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A METHOD FOR FACILITATING ACCESS TO MULTIMEDIA CONTENT 

Field of the Invention 

The present invention relates generally to enabling access to electronically- 
accessible multimedia content and, in particular, to searching access which is mediated 
5 through a user interface. 

Background 

As network connectivity has continued its explosive growth, content providers are 
using the World Wide Web (the "Web") to provide access to their multimedia content (eg. 
images, video, audio, etc.). Unlike textual content, such as HTML pages, multimedia 

10 content is not directly accessible to standard Web search engines. These search engines 
can examine sites on the Web and extract information about the textual content of those 
sites. Such information is typically termed "metadata" which is data that describes or 
catalogues aspects of other data. The extracted information (metadata) can then provide 
users with access to that content using their customised metadata databases. 

15 In the case of multimedia, typically content providers or distributors store 

information about the multimedia items to which they have access in metadata databases. 
The content providers then enable access to these databases by providing a search engine 
that users or customers can access from a Web site, typically the content 
provider/distributor's own Web site. Customers wanting to purchase, or maybe view, 

20 content that a content provider/distributor has access to, can visit v the Web site and use the 
search engine to search the content provider/distributors metadata database. Typically the 
metadata database contains visual identifiers of the content (eg. thumbnails, video 
abstracts, audio previews, etc) as part of the metadata. The user can then make decisions 
about which item(s) they may wish to purchase/use based on the metadata that is returned 

25 from their searches. 

In many cases the multimedia content is digital and on-line and potential customers 
can purchase the rights to use or purchase a copy of the desired multimedia item from the 
content provider/distributor's Web site. More often than not this transaction is completed 
on the Web site and the potential customer can directly download their newly acquired 

30 content. However this model of providing access to multimedia content does not require 
that the content is on-line. For example, a potential customer might be able to purchase 
the rights to use, or a copy, of the desired content from the Web site but the content may 
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be delivered to the potential customer by non-electronic means (ie. the postal system). 
Another variation is that the potential customer may be redirected from a distributor's site 
to the actual content provider in order to purchase and acquire a copy of the desired 
content. Other variations include the potential customer being directed to a physical 
5 location to purchase the content and being posted books containing the metadata 
associated with items to be purchased. 

In all the abovementioned situations, the potential customer can only gain access to 
the content to which each content provider/distributor has access. If the potential customer 
wanted to perform a search across several different content providers/distributors, the 

10 potential customer would have to visit the Web site and use the search engine of each of 
the different content providers/distributors. This is time consuming and annoying because 
the potential customer needs to be able to use a different search engine interface each time. 

These problems have encouraged the development of very large metadata 
databases on the Web where a content distributor either purchases the rights to content or 

15 simply acts as a distributor for smaller content providers. Examples of such are the large 
image databases of Getty and Corbus. This approach has its own problems. Firstly, the 
approach does not scale because as the databases become very large, the search time 
increases. Further, typically all the metadata has to be structured in a similar fashion in 
order to contain the same metadata keys. However, such is not always desirable as 

20 different metadata may be more appropriate depending on the targeted use of the content. 
For example, images captured for geological purposes would require different metadata 
that those captured for holiday brochures. Thirdly, smaller content providers have no way 
to directly sell their content (ie. they are effectively forced to use the larger distributors). 

It is an object of the present invention to ameliorate one or more disadvantages of 

25 the prior art. 

Summary of the Invention 

In accordance with one aspect of the present invention there is disclosed a method 
of facilitating access to multimedia content distributed over a computer network, said 
method comprising the steps of: 

30 at a server within said network, monitoring location recording activities of 

subscribers of said server to form a ranked list of multimedia locations recorded by said 
subscribers; 
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at a browser coupled to said server, receiving search criteria relating to said 
multimedia content entered by a subscriber; and 

applying said search criteria to said ranked list to provide said subscriber with 
network identifiers for said multimedia content corresponding to those said recorded 
5 locations best satisfying said search criteria. 

In accordance with another aspect of the present disclosure there is provided a 
method of forming a searchable list of computer network locations, said method 
comprising the steps of: 

monitoring, at a server to a plurality of network subscribers, bookmarking 
10 activities of said subscribers; 

forming a list of network identifiers bookmarked by individual ones of said 
subscribers; and 

ordering said network identifiers in said list according to a frequency of 
bookmarking by said subscribers. 
15 In accordance with another aspect of the present disclosure there is provided a 

method of facilitating access to multimedia content distributed over a computer network, 
said method comprising, at a server within said network, the steps of: 

providing an application for subscribing users thereof to enable said users to 
access multimedia content, each item of said multimedia content being identified by a 
20 network identifier; 

monitoring individual said subscribing users' intent to use or store items of said 
multimedia content; and 

in response to said monitoring, forming a ranked list of network identifiers 
relating to said items for use in facilitating access to said multimedia content for a 
25 predetermined set of said subscribing users, wherein an ordering of said list is determined 
by a frequency that the corresponding said network identifier is referenced with intent to 
use or store by said subscribing users. 

In accordance with another aspect of the present disclosure there is provided a 
browser application for operation upon a subscriber terminal, said application comprising 
30 means for signalling to a server to which said application couples, a bookmarking of a 
location accessed by said application, said location being within a computer network 
incorporating said server. 

In accordance with another aspect of the present disclosure there is provided a 
server operating within a computer network, said server having at least one user browser 
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application associated therewith and via which said user browser application accesses 

locations within said network, said server comprising: 

means for receiving from said user browser application, bookmark information 

relating to a location recorded for subsequent access by said user browser application; 

5 means for integrating said bookmark information received from plural ones of 

said user browser applications to form a database of said bookmark information. 
Other aspects of the present invention are also disclosed. 

Brief Description of the Drawings 
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One or more embodiments of the present invention will now be described with 
reference to the drawings and appendices in which: 

Fig. 1 is a block diagram showing the operating environment of a multimedia access 
system; 

5 Fig. 2 is a more detailed block diagram showing how the media browser accesses 

metadata databases; 

Fig. 3 is a flow chart depicting the communication process between the media 
browser and the metadata server; 

Fig. 4 shows the visual appearance of the user interface of the media browser 
10 component of the multimedia access system; 

Fig. 5 is a flow chart showing the preferred browsing process of the media browser; 
Fig. 6 is a flow chart showing the preferred searching process of the media browser; 
Fig. 7 depicts a structured image metadata database; 

Fig. 8 shows and example of an XML schema for browsing media resources; 
15 Fig. 9 is a schematic block diagram representation of a computer system upon which 

the media browser may operate; 

Fig. 10 depicts an example implementation of the system of Figs. 1 to 8; 

Fig. 1 1 depicts customisation of the media browser for different devices; 

Fig. 12 shows an arrangement by which the right to use multimedia content may be 
20 controlled; 

Figs. 13 A, 13B and 13C illustrate methods by which metadata links may be 
communicated between devices; 

Fig. 14 shows an arrangement by which a current media browser session may be 
switch from one device to another; 
25 Fig. 15 depicts an example of how a source description can be transformed into a 

normalised description that is presentable by the media browser arrangement, 

Fig. 16 is a screen dump of a preferred media browser graphical user interface; 

Figs. 17A and 17B illustrate how "breadcrumb" navigation is used in the interface of 
Fig. 16; 

30 Fig. 18 is a depiction of a modified graphical user interface; 

Appendix 1 is an XML source description for the example of Fig. 15; and 
Appendix 2 is an XML stylesheet formed from the source description of Appendix 1 . 
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Detailed Description 

I. Overview 

The operating environment of a multimedia access system is shown in Fig. 1 . A 
computer system 100 is shown in which a computer application program, hereinafter 
5 called a media browser 101, operates on a local computer 105 to form a connection to a 
computer network, such as the Internet 102. As illustrated, the Internet 102 has associated 
therewith a number of server computers 108 and 109, each of which may host a number of 
Web sites and for each of which there is a corresponding store 112 and 114 in which 
multimedia content may be retained. The local computer 105 similarly has an associated 

10 store 107. The media browser application 101 provides a single user interface for a user to 
browse and search the system 100 for multimedia items using electronically-accessible 
metadata. In other words, the media browser 101 operates on metadata. Any 
playing/viewing of multimedia content is achieved by the use of plug-in media tools and is 
separated from the metadata-related processing. The media browser 101 is described in 

15 more detail in Section IV below. 

The described arrangements may practiced using a general-purpose computer 
system 900, such as that shown in Fig. 9 wherein the processes of Fig. 1 and to be 
described are implemented as software, such as an application program executing within 
the computer system 900. In particular, the method of media browsing is effected by 

20 instructions in the software that are carried out by the computer system. The software may 
be divided into two separate parts; one part for carrying out the browsing methods and 
another part to manage the user interface between the latter and the user. The software 
may be stored in one or more computer readable media, including the storage devices 
described below, for example. The software is loaded into computers of the system from 

25 the computer readable media, and then executed by the computers. A computer readable 
medium having such software or computer program recorded thereon is a computer 
program product. The use of the computer program product in a computer preferably 
effects an advantageous apparatus for media browsing. 

The computer system 900 comprises a computer module 901, input devices such as a 

30 keyboard 902 and mouse 903, output devices including a printer 915 and a audio-visual 
output device 914. A Modulator-Demodulator (Modem) transceiver device 916 is used by 
the computer module 901 for communicating to and from a communications network 920, 
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for example connectable via a telephone line 921 or other functional medium. The 
network 920 may for example be the Internet, and/or other network systems, such as a 
Local Area Network (LAN) or a Wide Area Network (WAN). Collectively, the devices 
901 - 916 may form for example either one or any of the local computer 105 or server 
5 computers 108 and 109 of Fig. 1 and are often are described as computer workstations 

The computer module 901 typically includes at least one processor unit 905, a 
memory unit 906, for example formed from semiconductor random access memory 
(RAM) and read only memory (ROM), input/output (I/O) interfaces including a audio- 
visual interface 907, and an I/O interface 913 for the keyboard 902 and mouse 903 and 

10 optionally a joystick (not illustrated), and an interface 908 for the modem 916. A storage 
device 909 is provided and typically includes a hard disk drive 910 and a floppy disk 
drive 911. A magnetic tape drive (not illustrated) may also be used. A CD-ROM 
drive 912 is typically provided as a non- volatile source of data. The components 905 
to 913 of the computer module 901, typically communicate via an interconnected bus 904 

15 and in a manner which results in a conventional mode of operation of the computer 
system 900 known to those in the relevant art. Examples of computers on which the 
described arrangements can be practised include IBM-PC's and compatibles, Sun 
Sparcstations or alike computer systems evolved therefrom. 

Typically, the application program is resident on the hard disk drive 910 and read 

20 and controlled in its execution by the processor 905. Intermediate storage of the program 
and any data fetched from the network 920 may be accomplished using the semiconductor 
memory 906, possibly in concert with the hard disk drive 910. The audio-visual output 
device 914 may be used to provide a graphical user interface to the application program by 
which user input may be afforded via the keyboard 902 and by clicking buttons on the 

25 mouse 903 as a mouse-cursor is manoeuvred across the interface represented on the audio- 
visual output device 914. In some instances, the application program may be supplied to 
the user encoded on a CD-ROM or floppy disk and read via the corresponding drive 912 
or 911, or alternatively may be read by the user from the network 920 via the modem 
device 916. Still further, the software can also be loaded into the computer system 900 

30 from other computer readable medium including magnetic tape, a ROM or integrated 
circuit, a magneto-optical disk, a radio or infra-red transmission channel between the 
computer module 901 and another device, a computer readable card such as a PCMCIA 
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card, and the Internet and Intranets including e-mail transmissions and information 
recorded on websites and the like. The foregoing is merely exemplary of relevant 
computer readable media. Other computer readable media may be practiced without 
departing from the scope and spirit of the invention. 
5 Returning to Fig. 1, the metadata used by the media browser 101 can be accessed 

directly from the local computer 105 or from any accessible site on the Internet 102 such 
as the server 108. Typically the metadata for a collection of multimedia content is stored 
in collections (eg. repositories or databases) with each item of content having at least one 
corresponding metadata item. As seen in Fig. 1, each store 107, 112 and 114 has 

10 associated therewith a corresponding database 106, 110 and 111 respectively, which is 
configured to retain metadata items to facilitate access to the content within the 
corresponding respective store 107, 112 and 114. Hereinafter, a metadata item is referred 
to as a description of its corresponding item (typically, of content) and the term metadata 
collection refers to collections of such descriptions. 

15 In the preferred arrangement, the media browser 101 is able to access the metadata 

without having to access the content (107, 112, 114). In other words, a metadata 
description is not stored as an integral part of an item of content. This means that the 
media browser 101 does not need to be able to directly interpret the large number of 
storage/transport formats for audiovisual content in order to access metadata. 

20 The media browser 101 assumes that each description (in the databases 106, 110 and 

111) has a link to its corresponding content (107, 112, 114). If the content is stored 
electronically, these links can be actuated or electronically followed (eg. 120, 115, 1 16) by 
a user or process. Alternatively links, such as the link 118, can describe a route to a non- 
electronic location (eg. a film archive). Non-electronic links are not active (ie. unable to 

25 be followed by a remote user or process) and hence are only informative of available 
content. Accordingly, with such non-electronic links, remote users do not have a capacity 
to preview content using the media browser 101 . 

The media browser 101 requires that the metadata can be expressed in a standard 
manner. In the preferred arrangement, the structure of individual descriptions are 

30 determined by a schema. Descriptions of different items of content can use different 
schema. Typically the schema used reflects the type of content and the typical use or 
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purpose of the content. For example, a metadata schema for geological satellite images 
would most likely be significantly different to a schema for digital home video. 

Schema may differ in their syntactical structure and the nature of the types of 
description components (hereinafter called descriptors). For example, a schema for digital 
5 home video may model descriptions of this type of content to contain a digital video tape, 
which contains one or more scenes, each of which contain one or more clips or shots. The 
geological satellite image schema may simply have a number of descriptors, with a 
particular geological focus, which are used to describe each image. In the preferred 
arrangement schemas are represented using the W3C Extensible Markup Language (XML) 

10 Schema language and individual descriptions are represented as XML documents. The 
metadata representation is described further in Section n. 

Fig. 2 shows an example of how a media browser 101 can access metadata over the 
Internet 102. All access to metadata is achieved using links with the target of each link 
being expressed as a Uniform Resource Identifier (URI). These links can be actuated 

15 either automatically by the media browser 101 or in response to a user action (eg. clicking 
on an item of interest). 

In the case where the metadata is stored in an XML repository (collection of XML 
documents) 200, the media browser 101 can provide access to the metadata stored in the 
repository 200 using a link to an XML description of the repository 200. This description 

20 represents the structure of the repository 200 that is presented to a user of the media 
browser 101. The XML description is represented in the same way as a description of a 
multimedia item of content. In other words, the description must conform to an XML 
schema that is accessible to the media browser 101, and which describes the structure of 
the repository 200. The XML description can contain links to other descriptions of 

25 particular sections of the repository 200, so that the description of the repository 200 does 
not need to be contained within a single XML document). Ultimately the repository XML 
description has links to descriptions of multimedia items. Each description of a 
multimedia item in the repository 200 contains a link 201 to a corresponding multimedia 
item in the corresponding content collection 202. This enables the media browser 101 to 

30 be able to retrieve these items if a user or customer selects to view or play the item based 
on the presented metadata. 
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In the case where access to a non-XML repository, here called a legacy database 210, 
is desired, the link described above with reference to Fig. 1 must operate through a server 
module called a metadata server 212. The metadata server 212 is typically located, though 
not necessarily, at the site of the metadata (ie. either local or remote) and is configured and 
5 controlled by the owner of the metadata. The purpose of the metadata server 212 is to 
effectively translate the metadata stored in a legacy database 210 to the format required by 
media browser 101. In other words, the metadata server 212 must provide one or more 
schemas for the metadata and XML descriptions that conform to these schemas. 
Typically, a metadata server 212 will only need to provide a schema that describes the 

10 structure/syntax of the metadata collection and a further schema that describes the 
structure/syntax of the descriptions stored in the legacy database 210. As with the case 
where the remote metadata is stored in an XML repository 200, the descriptions of 
multimedia items, that the metadata server 212 generates, contain links to the 
corresponding multimedia items stored in a content collection 214 corresponding to the 

1 5 legacy database 210. 

A link to a metadata server is also represented using a URL A query string is 
appended to the URI to contain the metadata server request. The media browser 101 
interprets the link by first using the identifier part of the URI to locate the appropriate 
metadata server on the network. The media browser 101 then forwards the query string to 

20 the located metadata server for processing. Processing of the query results in descriptions 
of either the structure of the collection or multimedia items depending on how the 
metadata server 212 interprets the query string. 

The descriptions that are dynamically generated by the metadata server 212 can be in 
response to media browser user browsing or search requests. Metadata servers are 

25 discussed further in Section III below. 
II. Metadata Representation 

The preferred arrangement assumes that all descriptions of multimedia items 
conform to a schema, and that schemas are expressed or represented using XML Schema. 
Individual descriptions are represented using XML document instances. XML Schema are 

30 also represented as XML documents. Therefore descriptions (eg. of multimedia items) can 
be stored along with their respective schemas in XML repositories or object stores. 
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Alternatively, the descriptions can be stored in a database and effectively translated into 
XML documents when required. 

Each description contains a reference to the schema to which it conforms. The 
reference is expressed using a URI (eg. http://somesite/schemas/DigitalVideoSchema.xsd). 
5 This means that once a media browser has access to a description it can directly access to 
the schema to which the description conforms. 

In this document, the term "descriptor" is used to refer to a component, or atom, of a 
description. Each descriptor comprises a feature (descriptor name) and a value 
(description value). In some cases, the descriptor value comprises other descriptors, and 
10 thus may form a "complex descriptor". In other cases, the descriptor value is a scalar 
value such as a string or date (ie. simple or atomic descriptors). In all cases media 
browser 101 assumes that descriptors are represented with the element (tag) name being 
the descriptor name and the content of the element being descriptor value. For example, a 
simple descriptor may use the textual content of the element (ie. the text between the tags) 
15 to represent the value of the descriptor (eg. a date, text string, enumeration, etc.). 

This assumption about the structure of the metadata is not unlike how many 
practitioners currently use markup languages. In other words, it does not require 
significant changes from how practitioners might represent particular metadata 
vocabularies. 

20 Some examples of descriptors are now provided. In the simple descriptor, 

<Photographer>John Smith</Photographer>, Photographer is the name of the 
descriptor and John Smith is the value of the descriptor. The type of the text of a simple 
descriptor can be constrained using the simpleType construct of XML Schema. 

In the example shown in Fig. 8, both VideoScene and Clip are complex descriptors. 

25 The value of the VideoScene descriptor is the markup that is contained within the start 
and end tags of the descriptor. The name of the descriptor is the tag name (ie. 
VideoScene). Similarly the value of the Clip complex descriptor is that markup 
contained between the start and end tags of the Clip descriptor. The Clip descriptor value 
contains two simple descriptors, Date and Location. The value of the Location 

30 descriptor is the text contained between the start and end Location tags (ie. Sydney, 
Australia). 
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In order to be able to better interpret the basic semantics of descriptions for the 
purposes of visually presenting descriptions in a meaningful way to users, the preferred 
arrangement includes a core schema which contains definitions of a number of basic 
attributes that description schema designers can use when they define their descriptors. An 
5 example of the definitions included in this core schema are shown below as Example A, in 
which only a fragment of the actual schema is shown. 



Example A 





1. 


<simpleType name = 'DescriptorType' source = 'string'> 


10 


2. 


enumeration value = TOC7> 




3. 


enumeration value = 'Index7> 




4. 


.enumeration value = 'Other'/> 




5. 


</simpleType> 




6. 


<attributeGroup name = 'DescriptorAttributes'> 


15 


7. 


<attribute name = 'id' type = 'ID' minOccurs = '0' maxOccurs = T/> 




8. 


ottribute name = 'textldentifier' type = 'string' minOccurs = '0' 




9. 


<attribute name = Visual Identifier' type = 'uriReference' 

minOccurs = '0' maxOccurs = T/> 


20 


10. 


</attributeGroup> 




11. 


<attributeGroup name = TOCDescriptorAttributes f > 




12. 


<attributeGroup ref = *DescriptorAttributes7> 




13. 


<attribute name = 'descriptorType' type = 'DescriptorType' use = 'fixed' 

value = TOC7> 


25 


14. 


</attributeGroup> 




15. 


<attributeGroup name = 'lndexDescriptorAttributes'> 




16. 


<attributeGroup ref = 'DescriptorAttributes7> 




17. 


<attribute name = 'descriptorType' type = 'DescriptorType' use = 'fixed' 

value = 'Index7> 


30 


18. 


</attributeGroup> 



The attribute descriptorType is used to define whether a descriptor is to be treated 

s 

as part of a Table of Contents (TOC descriptor), or as part of an index (index descriptor). 
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A TOC descriptor is used to describe the structure of a description, and is typically a 
complex description. A TOC descriptor is navigable in the sense that such must contain a 
link within either its attributes or within the attributes of its children. The target of the link 
can be either a further description or an item of content. A TOC descriptor is similar to an 
5 entry in a table of contents of a book in that it enables a reader to go directly to a section of 
the work. 

Index descriptors are typically leaf nodes of a hierarchically-composed descriptor 
structure and are often referred to as properties (ie. the type of descriptive information that 
is displayed using a properties dialog in a Microsoft Windows (registered trade mark) 

10 system (eg. the Date and Location descriptors in the above example). Section IV below 
describes how this attribute is used by the media browser. 

Attributes are also used to contain visual and/or textual identifiers for a descriptor. 
A visual identifier (ie. visual Identifier attribute) can be the URI of a thumbnail or 
movie/audio track preview. A text identifier (ie. textldentifier attribute) can be used in 

15 the place of, or in addition to, a visual identifier. A text identifier typically contains a 
string value which describes the descriptor. In the absence of a visual identifier, the media 
browser can construct a visual representation based on this text value. These core 
attributes "drive" the user interface of the media browser. In other words, they have been 
included for presentation purposes. 

20 In addition to the core visualisation attributes that are defined in the core schema, the 

preferred arrangement uses the linking attributes of the evolving W3C XLink standard (as 
described at http://www.w3.org/TR/2000A/VD-xlink-20000221/ ) to provide linking 
semantics. XLink provides a framework for creating both basic unidirectional links, such 
as the HTML <A> linking element, and more complex linking structures. Simple linking 

25 elements are a common linking requirement for the preferred arrangement. These links 
can be used to represent links between two descriptors (ie. items of metadata) and links 
between descriptors (metadata) and content (eg. images, video, etc.). XLink also provides 
for other linking types such as extended links, locators and arc. The full list of linking 
types is described at http://www.w3.orgrTR/2000/WD-xlink-20000221/ . 

30 The existence of a link, using XLink, is asserted by an XLink linking element. 

These elements need to be able to recognised by applications in order to provide 
appropriate display or behaviour. XLink uses a namespace to accomplish link recognition. 
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The XLink namespace used by the preferred arrangement has the URI, 
http://www. w3.org/1999/xlink, and is associated with the xlink prefix. This association 
is achieved using the xmlns attribute of XML (eg. xmlns:xlink = 
< http://www.w3.org/1999/xlink , ). XLink's namespace provides definitions of global 
5 attributes that can be used on elements that are in any arbitrary namespace. These global 
attributes (xlink:type, xlinkihref, xlink:role, xlinkititle, xlinkrshow, xlinkiactuate, 
xlink:from and xlink:to) can be used to make elements recognisable as linking elements. 
For example, if the value of the xlink:type attribute is set to simple for a particular 
element, then that element is treated as a simple linking element and the value of the 
10 attribute, xlinkihref, contains the target of that link. For the purposes of this description, 
definitions of the linking attributes using XML schema are included below in Example B. 



Example B 



1 . <?xml version-1 .0'?> 

15 2. <!- XML Schema schema for Xlink draft dated 21 February 2000~> 

3. <!DOCTYPE 

4. schema PUBLIC "-//W3C//DTD XMLSCHEMA 19991 21 6//EN" 

"XMLSchema.dtd" > 

5. <schema 

20 6. xmlns = 'http://www.w3.org/1999/XMLSchema' 

7. targetNamespace = 'http://www.w3.org/1999/xlink' 

9. elementFormDefault = 'qualified 1 

9. version = '1.0'> 

10. <simpleType name = 'LinkType' base = 'string'> 
25 11. enumeration value = 'simpley> 

12. enumeration value = 'extended7> 

13. enumeration value = 1 locator7> 

14. enumeration value = 'arc'/> 

15. enumeration value = *resource7> 
30 16. enumeration value = 'title'/> 

17. enumeration value = 'none'/> 

18. </simpleType> 

19. <simpleType name = 'ShowType' base = 'string'> 

20. enumeration value = 'new7> 
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21. 


enumeration value = YeplaceV> 


22. 


enumeration value = 'embed7> 


23. 


enumeration value = 'undefined'/> 


24. 


</simpleType> 


25. 


<simpleType name = 'ActuateType' base = 'string'> 


26. 


<enumeration value = *onl_oad7> 


27. 


<enumeration value = 'onRequest7> 


28. 


enumeration value = 'undefined7> 


29. 


</simpleType> 


30. 


ettribute name = 'type 1 type = 'LinkType7> 


31. 


ettribute name = 'href type = 'uriReference' minOccurs = '1' 




maxOccurs = 'unbounded'/> 


32. 


ettribute name = 'role'. type = 'QName' minOccurs = '0' maxOccurs = T/> 


33. 


<attribute name = 'title 1 type = 'string' minOccurs = '0* maxOccurs = T/> 


34. 


ettribute name = 'show* type = 'xlinkiShowType* minOccurs = '0' 




maxOccurs = T/> 


35. 


<attribute name = 'actuate' type = 'ActuateType' minOccurs = '0' 




maxOccurs = T/> 


oo. 


^aiiNDUie nam© — xiiriirurn iypc — ui irvt?it?ici lot? iiiiiiv^ouuio vj 




maxOccurs = T/> 


37. 


ettribute name = 'to' type = 'uri Reference' minOccurs = '0' 




maxOccurs = T/> 


38. 


</schema> 



25 A particular schema can use the core XLink and media browser attributes when 

declaring individual descriptors for a schema. In Example C below, the particular 
descriptors VideoClip, Date and Photographer are declared for a particular schema. 
Note that only a fragment of an actual schema is shown and access to both the core media 
browser and XLink schemas is assumed via the namespace prefixes mb and xlink, 

30 respectively. Preferably these schemas are assigned the abovementioned namespace prefix 
using the xmlns attribute of XML schema. The media browser attributes are referenced 
unchanged from their definitions as seen at line 13 and TOCDescriptorAttributes. 
However the XLink attributes are referenced, for example as seen at line 14, and are 
further restricted from their original definitions. For example, the VideoClip descriptor is 
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a simple linking element so the xlink:type attribute's value can be fixed as simple. With a 
simple link, the element (descriptor) is the link source and a single linkend must exist. 
This single linkend is represented using the xlink:href attribute. A value must be supplied 
for this attribute for the simple link to be valid (hence the use constraint for this attribute 
5 is set to required). 



Example C: 



10 



15 



20 



25 



1. 

2. 

3. 

4. 

5. 

6. 

7. 

8. 

9. 

10. 

11. 

12. 

13. 

14. 

15. 

16. 

17. 

18. 



<element name = VideoClip^ 
<complexType> 

<element name = 'Date'> 

<complexType base = 'Date' derivedBy = 'extension^ 

<attributeGroup ref = 'mb:lndexDescriptorAttributes7> 
</complexType> 
</element> 

<element name = Photographer 1 > 

<complexType base = 'String' derivedBy = 'extension^ 

<attributeGroup ref = 'mbilndexDescriptorAttributes'^ 

</complexType> 
</element> 



Al 



<attributeGroup ref = 'mbiTOCDescriptorAttributes'^ 
<attribute ref = 'xlink:type' use = 'fixed 1 value = 'simple7> 
<attribute ref = Wink^iref use = 'require d7> 
<attribute ref = 'xlinkirole^tise = 'fixed 1 va A2 'esourceV> 
</complexType> 
</element> 



A description conforming to this particular schema fragment may contain the 
fragment of Example D: 



30 Example D 

1 . <VideoClip xlink:href = < http://someSite/content/video/clip999.mpg , > 

2. <Date>2000-04-1 8</Date> 

3. <Photographer>John Smith</Photographer> 

4. <A/ideoClip> 
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In the preferred arrangement, the core media browser attributes are explicitly 
expressed in the descriptions, and in schemas. Alternative arrangements can infer these 
attribute values from other information in descriptions, as described below. For example, 
5 a descriptor/element may be treated as part of the TOC if it contained a link within either 
its attributes or within the attributes of its children. Further, descriptors which do not have 
descendant links may be treated as index descriptors. Similarly visual identifiers may be 
automatically constructed from element (descriptor) names. 
Interpretation of Metadata 

10 In practice, not all the metadata that a user wishes to visualise using the media 

browser 101 will be represented in a core format known to the media browser 101 and the 
XLink attributes described above. On parsing a new description, the media browser 101 
first attempts to identify the type of metadata that has been received, examples of which 
may include Dublin Core, MPEG-7 or DIG35 for images, each of these being known in 

15 the art. Typically, this can be achieved by examining either the root element of the 
description or the namespace declarations. If the media browser 101 identifies a metadata 
standard, then the media browser 101 operates to invoke an XSLT stylesheet to transform 
the incoming document tree into one that explicitly uses the media browser and XLink 
attributes. No further processing is required. In other words, it is assumed that the 

20 transform results in a description that the media browser can present without further 
processing. 

For all other descriptions, before the parsed description is passed, as an XSDOM, to 
the calling method, a check is performed to attempt to ensure that the preferred media 
browser attributes are present. In one implementation, this check uses a list of rules for the 
25 creation of appropriate media browser attributes for the incoming metadata, if they are 
absent. The rules are as follows: 

(i) An href attribute is assumed to be a link and represented as an xlink:href 
attribute. If the target value of the link is a URI with an extension of XML or 
no extension, then a link to another description is assumed (ie. xlink:role is set 
30 to 'description'), otherwise the link is assumed to be a link to the relevant 

content (ie. xlink:role is set to 'resource'). The type of the link is assumed to 
be simple (ie. xlink:type is set to 'simple'). 
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(ii) An element is classified as a TOC descriptor if either the descriptor or 
any of its children contains a link (ie. mb:descriptorType is set to 'TOC'). 
The link may be represented in the original metadata as element content or an 
attribute. An element not classified as a TOC descriptor is assumed to be an 
Index descriptor. 

(iii) If a descriptor does not have a visualldentifier or a textldentifer then a 
textldentifer is created either from a name attribute of the descriptor, if it 
exists, or from the element name. In this regard, the media browser 101 
preferably always displays a visualldentifier if one exists, otherwise the 
textldentifer is used. 

Whilst the above lists only three rules, it will be appreciated that alternate and/or 
additional rules may be developed to provided for meaningful interpretation of unknown 
metadata types. 

However, the use of an XSLT stylesheet is a desired approach because a priori 
knowledge of the metadata format enables a stylesheet author to define informed 
transforms. For example, the value of the visualldentifier attribute may be taken directly 
from the value of another attribute. An example of a transform for some arbitrary video 
metadata that is based on a subset of known extended Dublin Core attributes to a form 
useable by Media Browser is shown in Fig. 15. 

An XSLT transform 1528 of Fig. 15 is configured with knowledge of the syntax and 
semantics of a source description 1580 for a video document 1500 having a title 
attribute 1502. For example, the shown transform assumes that the value of the 
DC. Identifier attribute 1510 of the source Scene elements 1504, 1506, 1508, and the 
DC. Identifier attribute 1518 of the Shot elements 1512, 1514, 1516 is just a reference 
identifier and does not provide additional information. For this reason, the transform uses 
these references as the values of the mb:id attribute. If these identifiers did carry 
significance to the user of the metadata then these attributes could have been transformed 
into index descriptors as, for example, the DC. Description attribute of the Scene 
element 1510. Note also that in Fig. 15 the transformed description does not maintain the 
initial frame granularity of the source description. This represents a decision made by the 
designer of the stylesheet 1528 which typically operates with knowledge of the media 
browser interface 101. 
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In the example of Fig. 15, it may initially appear counterproductive to transform a 
description that uses elements to represent structure and attributes to represent properties, 
into an element tree. However concepts of what information should be represented as 
attributes and what information should be represented as content often vary with media 
5 type, as described above. For this reason, transforming source metadata into an element 
tree is a form of normalising the metadata, and the transform 1528 thus results in a 
normalised description 1590 able to be processed and presented by the media browser 101. 

In Fig. 15 the source and transformed descriptions are depicted as element node trees 
with attributes shown in the boxes to the right of the corresponding node. The notation 

10 {att_name} is used to denote the value of the attribute of the corresponding element in 
the source document with the name att_name. The avptr notation is a method of 
addressing into audiovisual content using XPointer fragments, and is known in the art. 

The source description 1580 is an XML document seen in Appendix 1. The media 
browser 101 does not attempt to transform any relevant schema, if one exists. 

15 Consequently, the transformed description does not conform to a schema and therefore the 
description cannot be annotated. This is emphasised in the transformed description by 
setting the updateable attribute of the media browser 101 to false in the root 
element 1532 of the transformed description 1590. The XSLT stylesheet used to achieve 
the transform 1528 is seen in Appendix 2. In an alternative arrangement, a core 

20 Descriptor type may be defined. This core type may then be extended in definitions for a 
TOCDescriptor and IndexDescriptor. Schemas may then be created that contain 
definitions of new descriptor types that extend these basic types. In other words, type 
extension can thus used as an inheritance mechanism for the core attributes that are 
required by media browser. An example of these definitions is shown below as Example 

25 E. Note that the definitions use a similar set of attributes to those defined for the preferred 
arrangement with the exception that this example does not use the XLink attributes. 
Instead, Example E assumes that all links will be simple links (have a single source and 
destination with the source being the element that contains the href attribute). 

30 Example E 

1 . <simpleType name = 'DescriptorType* source = 'string'> 

2. enumeration value = TOC7> 
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3. 


enumeration value = *lndex'/> 






4. 


</simpleType> 






5. 


<complexType name = "Descriptor 1 > 






6. 


<attribute name = 'id' type = ID" minOccurs = XT maxOccurs = T/> 


5 


7. 


<attribute name = 'href type = 'uriReference' minOccurs = 


'0' 






maxOccurs - T/> 






8. 


<attribute name = Visualldentifier' type = 'uriReference' 








minOccurs = 4 0' maxOccurs = 


T/> 




9- 


<attribute name = 'type' type = 'DescriptorType' use = 'default 1 


10 




value = 4 lndex7> 






10. 


<attribute name = value/> 






11. 


</complexType> 






12. 


<element name = 'Descriptor' type = 'Descriptor'/> 






13. 


<complexType name = 'TOCDescriptor' base = 'Descriptor' 




15 




derivedBy = 'restriction^ 






14. 


<attribute name = 'type' use = 'fixed' value = TOC7> 






15. 


</complexType> 






16. 


<element name = TOCDescriptor* type = TOCDescriptor7> 






17. 


<complexType name = 'IndexDescriptor* base = 'Descriptor 1 




20 




derivedBy = 'restriction' > 






18. 


<attribute name = 'type' use = 'fixed' value = 'lndex'/> 






19. 


<attribute name = 'href type = 'uriReference' use = 'prohibited' /> 




20. 


</complexType> 






21. 


<element name = 'IndexDescriptor' type = 'lndexDescriptor7> 





25 



III. Metadata Servers 

A link to a metadata server 212 is represented using a URL An expression 
describing the request is appended to a URI that uniquely identifies the metadata 
server 212. For example, the URI: http://somesite/myMetadata/Svr?<query_string>, 
30 has an identifier component which is the part of the URI preceding the question mark 
symbol and a request component which carries information about the request to be sent to 
the metadata server 212. The identifier component is itself a URI. 

The preferred arrangement interprets the link by first using the identifier part of the 
URI to locate the metadata server 212 on the network 102. Failure to identify the metadata 
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server 212 results in a failed link and the media browser 101 user can be notified of the 
failure to detect a running process. In the preferred arrangement, the metadata server 212 
must be running as a process and the process being run by the metadata server 212 cannot 
be initiated from the media browser 101. In alternative arrangements, the media 
5 browser 101 may be configured to initiate the server process. 

When an identified metadata server 212 receives a request, the server 212 interprets 
the request and replies with an XML description that satisfies the request. The description 
is sent as clean text, however the description may be encoded if desired or necessary. The 
types and elements used in the description need to be defined in a schema that the media 

10 browser 101 can access. Although, the descriptions are not validated against their schema 
by the media browser 101 in the described arrangement, the media browser 101 requires 
access to the schema. Preferably, the types and elements of the schema used by the 
metadata server 212 are derived using the core attributes defined above in Section EL 

The requests directed at the metadata server 212 may be for metadata required for 

15 browsing or a search expression. The request can also specify various parameters that 
control the delivery of the XML back to the requesting media browser service. 

The results of requests that are directed at a metadata server 212 are descriptions 
which are preferably contained in an element, either of type, or derived from the type 
MetadataCollection, an example of which is provided below as Example F. The 

20 MetadataCollection type provides a means for the metadata server to explicitly return 
information to the requesting media browser application or service (eg. the number of 
items that satisfy the request and the number of items that are actually returned in the 
description). 



Example F 



25 


1. 


<complexType = 'MetadataCollection 




2. 


<any minoccurs = *0' maxOccurs ='unbounded'/> 




3. 


<attribute ref = mbrdescriptorType use = 'fixed' value = 'other7> 




4. 


<attribute name = YequestlD' type = 'string 1 minOccurs = T 

maxOccurs = T/> 


30 


5. 


<attribute name = 'noltemsldentified' type = 'integer 1 minOccurs = T 

maxOccurs = T/> 




6. 


<attribute name = 'noltemsReturned' type = 'integer* minOccurs = '1' 

maxOccurs = T/> 
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7. <attribute name = 'startltemReturnecT type = Integer 1 minOccurs = '1' 

maxOccurs = T/> 

8. </complexType> 



5 Before details of the request syntax are described, the overall processing model for 

the communication performed by the media browser 101 with a metadata server 212 is 
described with reference to the flow chart of Fig. 3. Firstly, in step 300, the metadata 
server 212 is identified from the URL The request is then sent to the identified metadata 
server 212 in step 301. The system then waits in step 302 for a reply. A check is 

10 performed in step 303 to see if a reply has been received. If not, then the waiting period is 
compared with a predetermined timeout in step 304 and if the waiting period is not greater 
than the timeout, control passes back to step 302. If the waiting period is greater than the 
timeout, an error is reported to the media browser user in step 306 and the process 
terminates in step 310 (ie. the metadata server 212 has not been reached for some reason). 

15 If a reply is received in step 303 the media browser 101 examines the response. If 

the media browser 101 cannot process the response (eg. the response is not correctly 
structured) then an error is reported in step 306 and the process terminates in step 310. If 
the response is able to be processed (ie. parsed) then it is passed to the appropriate module 
in the media browser 101 for further use and the process terminates in step 310. 

20 The request syntax will now be discussed in more detail. 

Typically most legacy databases store metadata in relational databases and access 
these databases using Standard Query Language (SQL). On the other hand, XML 
documents, and hence the media browser 101, represent information (metadata) in an 
hierarchical fashion. The metadata server 212 request must provide a bridge between the 

25 two different representations. Although it may be simpler to implement metadata servers 
if the request was based on SQL, the media browser 101 uses XML-related technology. In 
particular, the metadata server request is based on the W3C Recommendation XPath 
Version 1 .0, which may be found at http://www.w3.org/TR/xpath. 

XPath provides an extremely understandable way to describe a class of nodes to 

30 process. It is declarative rather than procedural and uses a simple pattern syntax modelled 
after directory notation. The most common form of XPath expressions are location paths. 
A location path selects a set of nodes relative to a context node. A location path can be 



535245.doc 



-21 - 



absolute (starts with a V to denote the root node) or relative (to a context node). For 
example, the expression book/author is a relative location path which selects all author 
children of book children of the context node. The XPath syntax is most easily understood 
by way of examples and examples are provided at http://vw\AA^ w3.org/TR/xpath . A 
5 number of XPath examples are as follows: 



(i) /* selects all the children of the root node 

(ii) /doc/chapter[5]/section[2] selects the second section of the fifth chapter of 
the doc. 

10 (iii) */para selects all the para grandchildren of the context node 

(iv) para[@type="warning"] selects all the para children of the context node that 
have a type attribute with the value of warning. 

(v) chapter[title- Introduction"] selects the chapter children of the context node 
that have one or more title children with string value equal to Introduction. 

15 

The location path syntax of XPath is directly useable for representing browsing 
requests and also for structured queries. In order to package unstructured queries (search 
expressions) as requests to the metadata server, XPath's function notation is used. This 
requires a more detailed understanding of XPath. 
20 The primary syntactic construct in XPath is the expression. An expression is 

evaluated to yield an object which is one of the following four basic types: 

• Node-set (an unordered collection of nodes without duplicates); 

• Boolean (true or false); 

• Number (a floating point value); and 
25 • String. 

A location path, as discussed above, is a special case of an XPath expression. A 
location path returns the set of nodes selected by the path. The part of the location path 
that is enclosed by square brackets '[]' is called the predicate. The predicate is itself an 
XPath expression which returns a Boolean result which serves to filter the node set 
30 selected with respect to the defined axis (tree relationship between the nodes selected and 
the context node) of the location step. 
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An expression can also be a function call, which, optionally, takes arguments. The 
EBNF (Extended Backus Naur form) definition of a function call is taken from Section 3.2 
of the above referenced W3C Recommendation found at http://www.w3.org/TR/xpath: 
FunctionCall ::= FunctionName '(' (Argument ('.' Argument )*)?') 1 
5 Argument ::= Expr 

Note the production Expr is the basic construct of XPath. A core function library 
exists which must be implemented by XPath implementations. Each function in the 
library is specified using a function prototype which gives the return type, the name of the 
function and the type of arguments. Although no core functions exist that can be used to 
10 pass the request to perform an unstructured query, it is trivial to extend XPath by defining 
a user function. 

Therefore, the syntax for requests is based on XPath with additional functionality to 
specify parameters that control the transmission of metadata to media browser. The syntax 
is detailed below using EBNF: 
15 Request ::= XPathExpression ('+' ParameterList )? 

ParameterList ::= Maximumltems ? ('&' Startltem )? ('&' 

NumberLevels )? C& TransactionlD )? 
Maximumltems ::= 'maxltems- Number 
Startltem ::= 'startltem- Number 
20 NumberLevels ::= 'noLevels- Number 

TransactionID ::= requestlD=Nmtoken 
Number ::= Digit (Digit)* 
The Request contains a single XPathExpression followed by an optional 
ParameterList. The XPathExpression matches the production LocationPath of the 
25 XPath Version 1.0 described at http://www.w3.org/TR/xpato with the exception that the 
predicate expression must support the following additional function call: 

Function: Boolean query(unstructuredQuery) 
This function can be included in a location path and be used to request that the 
metadata server 212 pass the unstructured query on to a search engine associated with the 
30 database 210. For example, the location path /Lifestyles/images[query("surfing 
tropics")] would therefore be interpreted by a metadata server 212 as finding all those 
images that are children of the Lifestyles node that satisfy the unstructured query "surfing 
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tropics". Note that the expression retains character spaces because the metadata server 
212 will know to take the content between the parentheses 0 as the query. 

Both Nmtoken and Digit mentioned above are defined in the XML Version 1.0 
Recommendation (see http://vAWW.w3.org/TR/1998/REC-xml-19980210 ). 
5 The ParameterList component of a Request is optional. ParameterList contains 

the optional individual productions Maximum Items, Startltem, NumberLevels and 
Transaction ID which specify the maxltems, startltem, noLevels and requestID 
parameters, respectively. If any these parameters are not specified then the media 
browser 101 uses a default value. 

10 The parameter maxltems refers to the maximum number of items to be returned by 

the metadata server 212. So, for example, if a particular section of a collection contained a 
large number of items then media browser could request the first, say (n = 101) items. The 
default value is specified by the user within the media browser 101. This parameter is 
automatically inserted into the Request by the media browser 101. If the user does not 

15 specify a value, a system default is used (eg. maxltems = 100) 

The startltem parameter allows the media browser 101 to get the next n items 
starting from a specified item number. The startltem parameter is useful in retrieving 
search results from a metadata server 212. If it is not specified in a URI, then a value 
of * r is assumed by the metadata server 212. 

20 The parameter noLevels enables the media browser 101 to define the structure of 

the returned description. Typically a single (hierarchical) level of description is required, 
however more levels may be desirable in the event of a user requesting a particular view 
that contains more than one level of the hierarchy (eg. scenes and clips for a video). If this 
parameter is not specified then a value of one (hierarchical) level is assumed. 

25 The requestID parameter allows a request to be formulated that refers to a previous 

request. For example, it may be desirable to obtain the next set of items from a previous 
request. If a requestID is specified then the metadata server 212 will attempt to reply 
using the previous request that is identified by the requestID. If the request identified by 
the requestID is no longer available in a cache of the metadata server 212 then the 

30 processing associated with the request will have to be repeated. The requestID is a 
unique value for the metadata server 212 and is generated by the metadata server 212 (and 
may be based on a timestamp representing the receipt of the request by the metadata 
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server212). The requestID can be returned to the media browser 101 using an element 
having the type, or being derived from the type, MetadataCollection (see Example F). 

When a user or a process arising from the media browser 101 actuates a link to a 
metadata server 212, an Hypertext Transfer Protocol (HTTP) request is sent to the 
5 metadata server 212 that is uniquely identified by the URI with the Request that is 
contained as part of the URI. 

Requests can originate as the result of user performing browsing or searching 
activities using the media browser 101. These requests can all be formulated following the 
Request EBNF production. If no parameters (in the ParameterList) are specified then 
10 the default values will be used. 
Browsing Requests 

In one implementation, a default Request, used when initially gaining browsing 
entry to a metadata collection for the purposes of browsing, may be the 
XPath Expression, "/*" with any desired parameters formatted in a ParameterList 

15 (eg. '7*+maxltems=1 00&noLevels=2"). The corresponding URI would then be: 
http://mySite/myMetadataSvr?/*+maxltems=100noLevels=2&, 
where //mySite/myMetadataSvr was the URI of the metadata server process. 

On receipt of this request, the metadata server 212 invokes a procedure to satisfy the 
request. This procedure results in the generation of an XML description of the associated 

20 metadata collection. This description thus reflects a structure by which the associated 
metadata collection may be browsed. It is common for the metadata collection to be stored 
in a database of some form. For example, the metadata server 212 may be configured to 
provide category or publisher sections for the collection so that users can more readily 
browse the metadata. Typically these categories are reflected in the schema used to 

25 describe the database items. Alternatively, the metadata server 212 may reply to the 
request from the media browser 101 by simply sending a list of all the separate items in the 
database. 

For the purpose of describing a typical scenario of use, consider an image metadata 
database with the following structure. The database is composed of a number of 
30 categories including Lifestyles, Sports and Animals as illustrated in Fig. 7. Whereas the 
Lifestyles category has no further structure (ie. it is composed of solely images), the Sports 
category is structured further into subcategories and the Animals category is structured 
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further into subcategories and then image classes. It is not important how this data is 
actually stored for the purposes of the present description. 

There is no fixed way that a metadata server 212 may implement its translation 
facility for its metadata collection. One possible way is described below. 
5 The metadata server 212 generates descriptions based on an XML schema which 

contains definitions of types for Categories, Subcategories, Classes and Images. These 
definitions use the core attributes of the media browser 101 and the global XLink 
attributes (see Section II above). A basic example of such definitions is provided below in 
Example G of an XML Schema. Note, that the definitions can use the xiink:show 
10 attribute to direct the media browser 101 to embed the target of the link at the source 
(ie. the description fragment generated by the metadata server 212 would be simply 
included as the content of the link source element). Definitions may also set this attribute 
value to replace, in which case the media browser 101 would replace the descriptor, 
which is the link source, with the description fragment served by the metadata server 212. 

15 



20 



Example G: XML Schema Example 


1. 


<?xml version='1 .0'?> 


2. 


<!— XML Schema for SomeSite's metadata server -> 


3. 


<!DOCTYPE 


4. 


schema PUBLIC "-//W3C//DTD XMLSCHEMA 19991 21 6//EN" 




"XMLSchema.dtd" > 


5. 


<schema 


6. 


xmlns = 'http://www.w3.org/1 999/XMLSchema' 


7. 


xmlns:mb = 'http://www.research.canon.com.au/MediaBrowser' 


8. 


xmlns:xlink = 'http://www.w3.org/1999/xlink' 


9. 


targetNamespace = 'http://someSite/MetadataSvr' 


10. 


version = '1.0'> 


11. 


<complexType name = TOCDescriptor* > 


12. 


<attribute ref = 'xlink:type' use = 'fixed' value = 'simple'/> 


13. 


ottribute ref = 'xlink:href/> 


14. 


<attribute ref = 'xlink:role' use = 'default' value = 'description'/> 


15. 


<attribute ref = 'xlink:show' use = 'default' value = 'embed*/> 


16. 


<attributeGroup ref = 'mb:TOCDescriptorAttributes7> 


17. 


</complexType> 
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18. <element name = 'Image' type = TOCDescriptor7> 

19. <element name = 'ImageClass' type = TOCDescriptor7> 

20. <element name = 'Subcategory^ 

21 . <comp!exType base = TOCDescriptor' derivedBy = 'restriction^ 

22. <choice> 

23. <element ref = 'Image' minOccurs = '0' maxOccurs = 'unbounded7> 

24. <element ref = 'ImageClass' minOccurs = '0' 

maxOccurs = 'unboundedV> 

25. </choice> 
10 26. </complexType> 

27. </element> 

28. <element name = 'Category^ 

29. <complexType base = TOCDescriptor' derivedBy = 'restriction^ 

30. <choice> 

15 31. <element ref = 'Image' minOccurs = '0' maxOccurs = 'unbounded'/> 

32. <element ref = 'Subcategory' minOccurs = '0' 

maxOccurs = 'unbounded7> 

33. </choice> 

34. </complexType> 
20 35. </element> 

36. <element name = 'Metadata Server Response' type = 'mb: Metadata Collections 

37. </schema> 

The TOCDescriptor provides a base type which acts as a simple XLink linking 
25 element (ie. xlink:type is fixed as having a value of simple). Links back to the metadata 
server 212 are expressed using the xlink:href attribute (see Section II above). The default 
behaviour for the link is to embed the target (ie. the generated description) as the content 
of the source description. Also the default role of the link has been given as description. 
If a particular link refers to content, then this role is preferably changed to be resource. 
30 The declared elements also use the core visual Identifier attribute. This attribute is 

used by the media browser 101 to provide a visual representation of the content of the 
item. For example, if the item is an image then the visualldentifier attribute value will 
typically contain the URI of a thumbnail of the image. In the case of categories, 
subcategories and classes the visualldentifier attribute value can contain the URI of an 
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icon. If this attribute is not specified then, preferably, the media browser 101 generates the 
visual identifier for the item from a provided textldentifier attribute value, or, in the event 
that this value is also not provided, from the name of the element (in this case Image, 
Class, Subcategory or Category). 
5 On receipt of the "/*" Request, the metadata server 212 generates an XML 

description of the collection as in the XML fragment below of Example H. The 
description is contained in an element declared to be of type 'MetadataCollection' (see 
Example G). Note that the metadata server needs only specify the XPath Expression in 
its returning links. It is the responsibility of the media browser to add the ParameterList 
10 to the URI before despatching the request. 

Example H: Returned XML Description Fragment 



1 . <MetadataServerResponse 

2. requestID = '19999123' 
15 3. noltemsldentified = '3' 

4. startltemReturned = T 

5. noltemsReturned = '3'> 

6. <Category 

7. textldentifier = 'Lifestyles' 

20 8. xlinkrhref = u http://mySite/myMetadataSvr?Category[@textldentifier= 

'Lifestyles']/! mage" 

9. visualldentifier = < http://mySite/Metadata/icons/Lifestyles.gif/> 

10. <Category 

1 1 . textldentifier = 'Sports' 

25 1 2. xlinkihref = u http://mySite/myMetadataSvr?Category[@textldentifier= 

'Sports f ]/Subcategory" 

13. visualldentifier = 'http://mySite/Metadata/icons/Sports.gif/> 

14. <Category 

15. textldentifier = 'Animals' 

30 16. xlink:href="http://mySite/myMetadataSvi^Category[@textldentifier 

='Animals f ]/Subcategory 

17. visualldentifier = 'http://mySite/Metadata/icons/Animals.gif/> 

18. </MetadataServerResponse> 
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In the Example H above description, XPathExpressions are used to identify links 
to each of the images in the Lifestyles category and the subcategories in the Sports and 
Animals categories. These links would be activated when a user selected to expand one of 
the above items when they were visually presented in media browser 101 . In the preceding 
5 and following examples, the XPathExpressions have been specified as relative location 
paths assuming that the context node is the root node of the collection. Alternatively, 
absolute paths can be used. 

If, for example, the visual identifier for the 'Sports' category is selected, then the 
corresponding link would be actuated. The metadata server 212 would respond to this link 
10 by generating and returning a description fragment as now indicated below in Example I. 



Example I: Returned XML Description Fragment 





1. 


<MetadataServerResponse 




2. 


requestID = '19999124' 


15 


3. 


noltemsldentified = '1200' i 




4. 


startltemReturned = '1' 




5. 


noltemsReturned = '1 00'> ! 




6. 


<Subcategory 




7. 


textldentifier = 'Basketball' 


20 


8. 


xlink:href = "http://mySite/myMetadataSvr?Category[@textldentifier= 
'Sports']/Subcategory[@textldentifier='Basketball']/lmage7> 




9. 


Subcategory 




10. 


textldentifier = 'Football' 




11. 


xlink:href = "http://mySite/myMetadataSvr?Category[@textldentifier= 


25 




'Sports •]/Subcategory[@textldentifier= , Football*]/lmage"/> 




12. 


Subcategory 




13. 


textldentifier = 'Hockey 




14. 


xlinkrhref = a http://mySite/myMetadataSvr?Category[@textldentifier= 
'Sports']/Subrategory[@textldentifier='Hockey']/lmage'' /> 


30 


15. 


</MetadataServerResponse> 



It is preferred that the returned description be well-formed. Further, the returned 
description must be able to be parsed by the media browser 101. The media browser's 
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action on receiving the contents of a link depend on the xlink attribute show. Typically, 
this attribute will be set to embed in which case the received description is embedded at 
the source of the link. If the received description used a container element (eg. of type 
MetadataCollection as defined in Example F) then this element is also embedded. 
5 Typically embedded container elements are preferably defined as having a 
descriptorType of 'Other' (see Example A). Alternatively, the show attribute can be set 
to replace in which case the contents of the link will replace the element containing the 
link source. If the xlink:show attribute is not defined for the linking element then the 
default action is embed. 

10 The description of the collection may be further explored by a user selecting one of 

these subcategories. This action would result in the metadata server 212 generating a 
description of the images contained in the selected subcategory. 

Note that the description of Example I, which is dynamically generated by the 
metadata server 212, contains only a single hierarchical level. This can be altered by 

15 specifying the noLevels parameter in the Para'meferList of the URL In some cases a 
Request might require two levels of hierarchical description in order to generate a view 
that requires both the parent and the children elements. For example, if the media browser 
101 was using a two-level view and wished to retrieve descriptions that contained two 
levels of hierarchy, then the media browser 101 would append the "noLevels=2" parameter 

20 to the URL For example, the link: 

http://mySite/myMetadataSvr?Category/Subcategory[Category 
/@textldentifier= 4 Sports , ]+noLevels=2 
would result in the description fragment shown below in Example J. 

The second level is assumed to be the children of the level targeted by the link. 

25 When the value of noLevels is greater than one, the values of the parameters, maxltems 
and startltem will refer to the lowest level of the description. Similarly, the values of any 
returned parameters also refer to the lowest level of the description. 



Example J: Returned XML Description Fragment 



30 1 . <MetadataServerResponse 

2. requestID = '199991 25' 

3. noltemsldentified = '500' 
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4. 


startltemReturned = '1' 




5. 


noltemsReturned = '100'> 




6. 


<Subcategory 




7. 


textldentifier = 'Basketball' 


5 


8. 


xlink:href = a http://mySite/myMetadataSvr?Category[@textldentifier= 

'Sports^Subrategory[@textldentifier = l Basketbair]/lmage n > 




9. 


<lmage 




10. 
11. 


textldentifier = 'ImageV 

x!ink:href = "http://mySite/myMetadataSvr?Category/Subcategory 


10 




/Image [@textldentifier='lmage1 Jl> 




12. 


<lmage 




13. 


textldentifier = 'Image2' 




14. 


xlink:href = H http://mySite/myMetadataSvr?Category/Subcategory 
/Image [@textldentifier=1mage2 , ]7> 


15 


15. 


Etc. 




m. 


</Subcategory> 




m+1. 


<Subcategory 




m+2. 


textldentifier = 'Football' 






ylink'hrpf = " h tt n 7/m vS itp/m vMetad a taSvr^Ca tea orvrfS) text Identifier 


20 




= < Sports']/Subcategory[@textldentifier = l Football , ]/lmage ,, > 




m+x 


Etc. 




n. 


</Subcategory> 




n+1. 


</MetadataServerResponse> 



25 Searching Requests 

Searching requests can originate from either the user specifying a structured query, 
using an advanced search option, or the user specifying an unstructured query, using the 
simple search option. If a structured query is compiled then this query is preferably 
represented using an XPathExpression as described for browsing in the previous section. 
30 Therefore only unstructured queries need to be considered in this section. 

Most metadata collections presently in existence have an unstructured search 
function. In many cases considerable effort has been expended to make this search 
function as optimal, in terms of speed and suitable results, as possible. Consequently it is 
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advantageous to use these search facilities whenever an unstructured query is specified by 
a user. 

Unstructured queries can be passed to the metadata server 212 using the query 
function call defined earlier in this section. This function call is preferably included within 
5 a predicate of a step of a location path. As location paths can contain a predicate for each 
of their location steps, an XPath Expression can contain more than one unstructured 
query expressions. However, most requests based on unstructured queries contain a single 
query expression. For example, the XPathExpression, //image[query("dog OR cat")], 
will select all the image items of the root node that satisfy the query "dog OR cat". 

10 Typically searches can result in a large number of items. The description that is 

returned to the media browser 101 can be limited in the number of items the description 
contains by using the maxltems parameter. After receiving the first set of results, the 
media browser 101 may then request a further set by using the startltem parameter. To do 
this, the media browser 101 includes the request ID that was returned by the metadata 

15 server 212 with the response to the original request. In other words, the returned 
requestID identifies the start of a transaction that can be accessed by later requests. 

The above has a number of implications for the configuration of the metadata 
server 212 as such requires the metadata server 212 to be able to save and access the 
results of previous requests. However, traditional server arrangements cannot maintain 

20 such results of requests in a cache indefinitely. Preferably, if a request arrives referring to 
a previous request, the metadata server 212 attempts to match the requestID to its cached 
request results. If the request is no longer in the cache, it is reprocessed. In an alternative 
arrangement, if a match cannot be achieved, then the metadata server 212 can optionally 
attempt to match the request on a textual similarity basis with other requests before 

25 resorting to reprocessing the request. This approach is helpful in that it can eliminate 
much duplicated processing by the metadata server 212. The size of the cache for a 
metadata server 212 can therefore be implementation dependent. 
IV. The Media Browser Application 

The media browser 101 provides the user with a single user interface for browsing 

30 and searching different metadata collections. An example graphical user interface 400 for 
the media browser 101 is shown in Fig. 4. The media browser interface 400 provides the 
user with the options of either browsing or searching for (particular items of) content via 
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metadata associated with the (items of) content. The media browser 101 can be 
implemented as a stand-alone application (eg. like Word97 manufactured by Microsoft 
Corporation of the USA) or as a service able to be supplied to multiple concurrent users. 
The preferred arrangement implements the media browser 101 as a service. In this mode 
5 each user is required to log in to the service to access their personalised TOC. The service 
aspect of the media browser 101 is discussed further in Section V below. The present 
section is devoted to describing the functionality of the media browser 101. The 
description assumes a media browser service however it should be evident that the 
functionality could equally well be implemented as a stand-alone program. 

10 Typically the media browser 101 is implemented with a set of default media tool 

plug-ins. A user of the media browser 101 can then select, and preferably download via 
the Internet, further media tools to plug-in to their own implementation. Each plug-in has 
a defined set of target media types. The separation of media playing/viewing from 
metadata browsing and searching is an important concept for the media browser 101 as 

15 such allows the application to be adapted to particular users/environments. 

The media browser 101 enables browsing access to the metadata by providing a 
Table of Contents (TOC) which represents the structure of the information landscape a 
user chooses to access. This information landscape can comprise links to local metadata 
and/or links to remote metadata and is typically customised by each user as the user 

20 discovers metadata sites which are relevant to personal interests. A default TOC is 
preferably provided for each new user. 

The underlying information landscape is represented at all levels as a description 
(ie. an XML document). This means that the base structure of the description, which for 
XML is a tree containing nodes and links, is the same whether the user is viewing the entry 

25 point of the TOC or viewing the details of a description of a multimedia item of content 
(eg. a digital video). Since the TOC is a visual representation of the information 
landscape, the user's navigation in the TOC is unchanged for all levels of the TOC. This 
means that the interface 400 operates the same whether a user is browsing metadata at 
different Web sites, different sections of a metadata collection (eg. categories in an image 

30 metadata collection), or within a description of multimedia content (eg. a clip in the digital 
video tape). 
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The TOC is formed by items that are selectable. These items comprise the visual 
representations of TOC descriptors (see Section II for more details on metadata 
representation). The items contain visual identifiers to aid the user in browsing. 
Typically, the visual identifier represents the content in some way. This is especially true 
5 for visual identifiers that correspond to items of multimedia content. Examples of visual 
identifiers include, simple or graphically-designed text, thumbnails of images, animations, 
and short previews of videos. Preferably, these visual identifiers are provided by the 
descriptions but, if not, the media browser 101 can graphically generate them from 
information contained in the description (eg. textldentifier attribute or element name). 

10 Visual identifiers have been discussed in more detail in Sections II and HI. 

The browsing functionality provided in the preferred arrangement may now be 
described with reference to Fig. 5. On activating the media browser 101, the initial 
description of the information landscape is read in step 500. This initial description 
usually contains a set of top-level links to different metadata collections or sections of 

15 metadata collections. The media browser 101 then, in step 501, processes this description 
and constructs an initial TOC from the description. Typically the processing of a 
description involves parsing the XML document that contains the description and 
representing the description using an object model in the computer memory. Preferably, 
step 501 involves detecting all the TOC descriptors from the description and building a 

20 TOC from those descriptors. Preferably, the differentiation between TOC and index 
descriptors is performed using the core descriptorType attribute as described in 
Section II. 

In steps 502 which follows, a view of the initial TOC is generated and presented to 
the user. This view may be provided in the form of a tree structure as used by applications 
25 such as WINDOWS EXPLORER manufactured by Microsoft Corporation. Preferably, a 
rectangular panel 402 is provided as seen in Fig. 4 showing the visual identifiers 404 that 
correspond to the items and the initial level of the information landscape. For example, 
this could be a grid of visual identifiers identifying a number of initial metadata 
collections. 

30 The media browser 101 then awaits a user event. When a user selects an item, for 

example by clicking on a visual identifier 404 in step 503, the item is examined in 
step 504 to determine whether the item has child items in the TOC. This will be the case 
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if a link has been previously followed by the user or an individual description contains 
more than one level of structure (eg. a description of a collection may contain several 
levels in one description). If the item has child items, then control proceeds to step 510 
and the view of the TOC is updated with the child items. 
5 If the selected item has no child items in the TOC, then in the step 505, the media 

browser 101 determines whether the item contains a link to a description. This can be 
achieved explicitly if the linking element, that represents the source of the link, has a 
specified role of description (roles of linking elements are described earlier). If the role 
of the link is undefined, then the media browser 101 decides whether the target is a further 

10 description based on the file extension of the URI of the link. For example, if the 
extension is "xml", then a description will be initially assumed. However, if on parsing 
the xml" file, the file is found not to conform to a specified description scheme, then the 
media browser 101 preferably treats the ".xml" file as a resource rather than a description. 
In the event that the selected item does contain a link to a further description, 

15 step 506 is implemented where the media browser 101 determines whether the specified 
description is available in the description cache (ie. the description has previously been 
fetched, perhaps for another user or a previous session with the present user). If the 
description is not available, then the media browser 101 fetches that description in 
step 507. This can be achieved by forwarding the fetch request onto a standard web 

20 server. The returned description is processed and stored in the description cache in 
step 508. In step 509, the TOC is then updated to reflect the new description using the 
same principles that were used in creating the initial TOC. Finally the view of the TOC is 
also updated in step 510 and presented to the user for further interaction. After step 510, 
control returns to step 503 where a further selection from the TOC may be made. 

25 The browsing event described in the previous paragraphs results preferably in the 

viewing panel being updated to contain the items at the new level of the information 
landscape. For example, this new level may show the major categories of a particular 
metadata collection. 

In the event that, at step 505, the selected item did not contain a link to a further 
30 description, the link is treated as a link to an item of content. The visual identifier of the 
item is highlighted in step 520 and further actions can occur in step 521. For example, the 
identifier could be selected with a number of other items to be dragged to a clipboard 406 
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or a shopping trolley 408 forming part of the interface 400. If a link to an item of content 
is double clicked by the user then the item is immediately presented or played using the 
default media tool for the content type of the selected item. 

A preferred implementation of the media browser 101 allows two types of searches. 
5 A simple search is constructed by the user providing a text query to a search entry box 410 
and select a simple search function 412. The user is also able to construct an advanced 
structured query using a list of the available index descriptors by selecting an advanced 
search 414. The latter option is possible because the media browser 101 has knowledge of 
the schemas used for the different descriptions. Preferably, the media browser 101 can 

10 construct a list of the index descriptors used by the schemas and the user can enter 
constraints for the query by directly manipulating the descriptors used. For example, if a 
user would like to search an image database for images that are published by Publisher 
"ABC" and have a cost in the range $100 to $200, a query is more likely to be successful if 
the user can build a structured query directly from the descriptors available rather than just 

15 use keywords in a textual query. The latter approach, which corresponds to the simple 
search function mentioned above, could result in the strings "ABC", "$100" and "$200" 
being located anywhere in image descriptions. The processing of structured search queries 
is discussed further below. 

The searching functionality of the preferred arrangement of media browser is 

20 described now with reference to Fig. 6. In a first step 600, the user specifies one or more 
context items for the search. These are items in the TOC that are to be searched when the 
search is initiated in step 603. Step 601 determines if the user selects an advanced search. 
If the user does not select to perform an advanced search, control passes to step 602 where 
the user is required to specify a text query as mentioned above. This query may be formed 

25 of a list of keywords or phrases that the user is interested in. 

If the user selects to perform an advanced search then control passes from step 601 
to step 620. A list of available index descriptors are then generated from the schemas that 
are relevant to all the descriptions contained in the list of context items. In a preferred 
implementation, index descriptors are distinguished from TOC descriptors by the 

30 descriptorType attribute as discussed in Section II above. The user in step 621 can then 
formulate a structured query based on the list of available descriptors and a set of basic 
search expression (Boolean) operators (eg. AND, OR, and NOT). The query can also 
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allow the user to express acceptable ranges on particular index descriptors. For example, 
the price of an item must be >$100 and <$200. 

In step 603 the user initiates the search with the current query (textual or structured). 
This is followed by step 604 where the first item in the list of context items is identified. 
5 A new thread or process is created and then started for the context item in step 605. In 
step 606, which follows, a check is made to see whether the identified context item has an 
associated metadata server. If the context item is the origin or root for a particular 
metadata collection, step 606 involves checking the link in the description. If the 
identified context item is not an origin or root item, then it is necessary to examine the 

10 TOC to establish whether a metadata server exists for a parent of the identified item. If 
such a check results in the location of a relevant metadata server for the identified item, 
then the context of the identified item within the metadata collection is included in the 
location path of the XPath Expression which carries the query as a request to the 
metadata server. For example, if a selected context item for a search was "Lifestyle 

15 category" in "Image Collection ABC" then the search request would be passed to the 
metadata server in a URI such as: 

http://www.lmagesABC.com/Me^^ 

'Lifestyles'l/lmagelquery^expression^ 
where <expression> contains the query (textual or structural). 

20 If an associated metadata server is identified in step 606 then the query is formulated 

as a URI (using the request syntax described in Section US) and sent to the identified 
metadata server in step 608. 

If no metadata server is identified then in step 606, a search is commenced for items 
satisfying the request in the identified context item in step 604 and then control passes to 

25 step 609 where any further context items are detected. If further items exist then, in 
step 610, the next item in the list of context items is identified, and control returns to 
step 605. If step 609 finds no more context items remain to be identified, then control 
passes to step 620 where the search process waits for search results to arrive. It will be 
appreciated in this regard that numerous search processes on individual context items may 

30 operate and return results substantially simultaneously. When all threads, or processes, 
complete, the results of the individual search processes are collated in step 625 and the 
process ends in step 630. 
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Users can use the browsing and searching functionality of the media browser 101 to 
locate multimedia content that is of interest. Users can build up temporary collections of 
items by dragging the visual identifiers of items onto clipboards 406 as shown in Fig. 4. 
The clipboards 406 can be optionally saved and recalled in a later session. Clipboards 406 
5 are treated like any other level of the information landscape in that they can be viewed in 
the viewing window and can be selected as a context item for a search. Clipboards 406 
can also be inserted in the information landscape under a "Clipboards" heading on the 
entry TOC. Users can save the contents of clipboards 406 and then retrieve and use the 
saved clipboards 406 at later sessions. 

10 If content is desired immediately and is available for on-line purchase, then the user 

can drag the item to the shopping trolley 408. The shopping trolley 408 is effectively a 
specialised clipboard. Alternative interfaces could simply represent the shopping 
trolley 408 as such. At any time the user can right-click on the shopping trolley to initiate 
a "purchase" plug-in media tool. Alternatively, the user might move the mouse over the 

15 trolley icon to display a menu of available media tool from which the user can make a 
selection. 

The "purchase" plug-in operates in the same fashion as the media tools, already 
described, which provide media viewing and playing capability. The user can then select 
an appropriate "purchase" tool for their implementation. The purchase tool simply 

20 examines each of the items in the shopping trolley 408, establishes whether those items 
can be purchased on-line and, if so, redirects the user to the content provider/distributor's 
site to purchase the item. In an alternative configuration, users can establish accounts with 
a media browser service and to purchase items through these accounts. Media browser 
services are discussed further in Section V. 

25 V. Media Browser Business System 

The media browser 101 described in Section IV can be implemented as a service. In 
a preferred implementation, the media browser 101 is technically implemented as a client 
- server application and operated as a service to which users can login from the Internet. 
Each user is preferably securely identified by a password and can store up to a specified 

30 limit of data with the service. This user data is composed of an initial TOC description, 
user preferences, stored clipboards and other information required for client operation 
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(eg. user preferences, information about locally installed plugins, etc.). Preferably this 
service is provided to the user for a periodic (eg. monthly) subscription fee. 

One of the main technical advantages of operating the media browser 101 as a 
service as described above is that descriptions can be cached. As such, for example, if 
5 company "ABC" installed a media browser service, and a large number of the users at 
company "ABC" used a particular metadata collection, the descriptions from this 
collection would be available in the description cache of the service. In other words, the 
descriptions would not need to be fetched for each individual user. This represents a key 
advantage. 

10 In the preferred implementation, the media browser service operates as a service 

linked to a standard web server. The media browser client can thus be implemented using 
a standard Web browser. This means that users can simply go to the media browser home 
page to start the client on the user's own computer workstation. The server typically 
operates continuously just as a standard web server does in most web sites. 

15 In the preferred implementation model, a default media browser server is operated 

from the site of a primary service provider (eg. the company which owns the right to the 
intellectual property of the technology). Other parties can purchase the rights to install 
their own media browser service on their own intranet. Such an option may be desirable 
for parties wanting to optimise the speed of the service for users of their intranet. 

20 A further advantage of the above disclosure lies in a business system centred around 

the concept of the metadata server 212. As described in Section HI, the metadata 
server 212 provides a means by which a content provider/distributor can make available 
the metadata stored in a legacy system, such as an SQL database. Therefore, the ability of 
a content provider/distributor to have a metadata server 212 serving their metadata 

25 collection effectively opens their customer base as potential customers now can access 
their metadata from potentially many sites. Indeed each media browser client can 
potentially provide access to the content provider/distributor's metadata collection. This 
provides the benefit of increased sales and exposure. 

However, as with all Web sites hoping to introduce the Internet public to their 

30 wares/content, potential customers need to know about the existence of the metadata 
server 212 of the content provider/distributor. To permit this to occur, when a content 
provider/distributor decides to become involved in the media browser/metadata server 
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system 100, the content provider/distributor downloads a sample (customisable) metadata 
server from the primary media browser service provider. With this, the content provider 
can modify the sample metadata server from common platform to one that incorporates a 
specific translator for interfacing the XML Schema format to the database format used by 
5 the corresponding database manager to access the legacy database of the content provider. 
One of the options for the content provider, on configuring the sample metadata server, is 
to select to have their newly "customised" metadata server included as a link on the default 
TOC entry that is distributed with all media browser services. This means that the link to 
the new metadata server would appear on the default service at the primary media browser 

10 service provider and would be distributed with the media browser software to each of the 
secondary services. This provides for direct advertising of the content provider's wares. 
Clearly, users can then customise their own TOC when they begin working with the media 
browser service. However, an initial presence of the link on the entry TOC introduces the 
user to the metadata collection made visible via the newly linked metadata server. 

15 In selecting to have a link to their metadata server included in the standard TOC, the 

content provider/distributor may agree to be billed a certain fee for each quanta of requests 
that their metadata server handles. This fee is typically very small (eg. US$1 for every 
10,000 requests). Preferably the installed metadata server has an integrated billing 
mechanism which is responsible for keeping a tally of the number of requests and then 

20 periodically billing the content provider/distributor for the service. A credit card number 
to be charged may be stored in a secure fashion within the metadata server and billing is 
automatic and electronic. 

In summary, the provision of a metadata server by the primary media browser 
service permits the content provider/distributor to provide an enhanced service and 

25 mechanism for advertising and selling multimedia content. The implementation of the 
metadata server effectively "opens up" the metadata collection of the content 
provider/distributor to a wider audience than that accustomed to simply visiting a search 
engine operated by the content provider/distributor. In addition the metadata 
browser/server system makes it more attractive for potential customers to browse/search 

30 the metadata because customers can perform these actions in a more convenient (ie. single 
interface) and time-efficient (ie. in parallel with other metadata collections) manner. 
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Taking the further step of using media browser services to effectively advertise their 
open metadata collection widens the potential customer base yet again. For this key 
additional advantage, the content provider/distributor undertakes to pay a small regular fee 
based on the number of requests their metadata server handles during a billing period. In 
5 the event that few requests are handled then costs charged to the content 
provider/distributor are small. This is a significant advantage, especially to smaller 
content providers. 

Fig. 10 shows an example implementation in which a local server 150 incorporates a 
media browser 152 as described above and which is available for use by a number of local 

10 users 154 ... 156 connected to the local server 150. The local server 150 provides for 
connection between the users 154 ... 156 and a number of content providers 160 and 170, 
as well as financial establishment 180, via the Internet 102. The providers 160 and 170 
each incorporate a legacy database 164 and a corresponding store 166 of content. 
Typically the database 164 comprises an array of tables that maps references to content to 

15 locations of content within the store 166. A metadata server 162 is also provided and is 
configured to receive media browser requests transmitted as URI's according to HTTP, 
and generate XML descriptions to satisfy the media browser requests. With this 
arrangement, local users 154 ... 156 having access to the media browser 152 are able to 
remotely access the content without having specific knowledge of, or using any call, 

20 commands or instructions unique to or associated with the legacy database 164. With such 
an arrangement, access for the user 154 to the content 166 may occur in a fashion 
transparent to nature of the database 164 (eg. whether the database is formed using SQL or 
dBase) whilst retaining the structural, organisational and searching attributes and functions 
of the database 164. 

25 When performing a search for content across a number of content providers listed in 

a TOC 158, the local user 154 may, for example, be provided with positive returns for 
each of providers 160 and 170. At this stage, the proprietor of the local server 150 may 
invoice each of the providers 160 and 170 for a fee for "introducing" or facilitating access 
of the local user 150 to the content of each provider 160 and 170. Colloquially, this may 

30 be considered a "spotter's fee" and could be charged in a number of ways such as based on 
the number of searches delivering results, or the number of results delivered by any search, 
or simply the number of requests that the metadata server processes. 
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Where the local user 154 is desirous of purchasing content returned by provider 160, 
a financial transaction may be performed between the local user 154 and the provider 160, 
perhaps via the financial institution 180, and without affect on or influence by the local 
server 150. In an alternative approach, the local server 150 may be interposed as a 
5 financial intermediary whereby the provider 160 bill the local server 150 for the purchase 
of the content, and the local server 150 on-bills to the local user 154. Such an approach 
may be more convenient and provide enhanced security for transactions than the previous 
billing and payment arrangement. For example, where content returned by a searching 
session is desired to be purchased from a number of content providers 160 and 170, the 
10 local user need only perform a single transaction with the local server. Since those two 
parties have a pre-existing relationship, user identification may be more relaxed than if the 
user were to purchase directly from the provider, with whom no relationship may exist. 
The same issues apply to the relationship between the local server 150 and the 
providers 160 and 170. 

15 Although the foregoing describes arrangements and implementations applicable 

respect to the provision of multimedia content, other goods and services may also be 
provided. For example, as seen in Fig. 1 where the link 118 is from the database 117 to 
physical goods, as opposed to multimedia content that may be downloaded electronically, 
the capacity of the user to enquire of the database 117, obtain a search result and ultimately 

20 perform a purchasing transaction remains. 

Further, some implementations may have no commercial basis for a specific 
financial transaction. For example, a national patent office may wish to make its 
proprietary database available to the general public. Implementation of the media browser 
and servers described above enables this to be performed without a need for the public to 

25 purchase or otherwise obtain enquiry software associated with the proprietary database or 
for the patent office to regularly translate its entire database into a generic form (eg. 
HTML) suitable for reading by traditional browsing applications. Such would therefore 
permit public access to a live proprietary database. 
VI. Customisation for Devices 

30 Fig. 11 shows a multimedia access system 1100 in which a user has authorised 

access to a media browser server 1102 for the purpose of browsing a communication 
network 1 106 such as the Internet to identify multimedia items of interest to the user and 
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for which reproduction may be desired. The media browser server 1 102 is associated with 
a database 1104 which incorporates table of contents (TOC) data specific to the user and 
incorporating locations previously visited or available to the user for review. With the 
table of contents and the media browser server 1 102, the user is able to extract multimedia 
5 content via a metadata server 1108 having associated therewith a corresponding content 
repository 1110 and metadata repository 1111. 

The user may access the media browser server 1102 by means of a desktop 
computer 1112, substantially corresponding to the arrangement shown in Fig. 9. With 
such an arrangement, the desktop computer 1112 has the capacity to reproduce, depending 

10 upon its configuration, most types of multimedia items including video, audio, and images, 
each provided in possibly a number of formats. 

In such a situation, the table of contents as supplied to the user at the desktop 
computer 1112 may appear as the table of contents 1114 which includes metadata relating 
to video, audio and the image content items. Each of these items are able to be presented 

15 in the table of contents 1114 since the desktop computer 1112 has the capacity to 
reproduce each of those data formats. As a consequence, such may represent the entirety 
of the user's table of contents as stored within the database 1104 of the media browser 
server 1102. 

However, and according to the present arrangement, when the same user operates an 
20 alternative device for media browsing and delivery, the table of contents presented to the 
user on that alternate device is modified to present only those items of content able to be 
reproduced on the presently used (ie. alternate) device. This is also illustrated in Fig. 1 1 
where the user operates a mobile telephone handset device 1116 which is able to be 
connected to the Internet network 1106 via a cellular base station and public switch 
25 telephone network 1118. The mobile telephone handset 1116 may, in this regard, perform 
browsing operations through the media browser server 1102 using an appropriate data 
format, such as the wireless application protocol (WAP). 

In this case however, the mobile telephone handset 1 1 16 is provided with capacity to 
reproduce only text on a display thereof and to reproduce sound by means of the loud 
30 speaker integrally contained therein or using headset device known in the art and 
connectable to the handset 1106. As a consequence, text browsing (using, for example, 
the textual identifiers rather than the visual identifiers in the metadata) may be performed 
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using the telephone handset to display a table of contents 1 120 limited only to audio items 
that are able to be reproduced using the loud speaker of the handset 1116. For example, 
where the handset 1116 incorporates an MP3 player module, audio components within the 
table of contents 1120 that include MP3 compressed data may be reproduced. Further, 
5 audio encoded in other formats, such as 8 or 14-bit PCM may be reproduced band limited 
to the 'telephone" waveband of 300Hz - 3kHz. 

Typically, the content requested by the user is stored at the site of the content 
provider (eg. metadata server 1108). Alternatively, the content may be stored in a secure 
fashion together with the media browser service 1102. With such a configuration, the 
10 media browser server 1 102 can customise the streaming of the content to the device, be it 
the computer 1 1 12 or telephone 1116, depending upon the destination device in use at the 
time. Such customisation may involve modifications of bandwidth, coding and any forms 
of encryption. 

Such an arrangement also permits the user to browse content in the form of previews 

15 prior to that content being purchased. Once satisfied with a preview, the user may then 
select to buy a right to access the content. This right may allow the user a single play/view 
of the digital item, play/view rights for a predetermined period of time, or an unlimited 
copy of the digital contents (CD or electronic). When purchasing the right to use the 
content, the user can also specify the quality of the service (QOS) desired for the 

20 purchased content (eg. the number of channels, associated movie clips, etc). Once a 
purchase is made, the QOS will represent the maximum QOS available to the user 
irrespective of the device being used. This is because if the user logs onto the media 
browser server 1102 from a device that cannot utilise the purchased QOS, a lower QOS 
will result. An example of this is where the user purchases the right for reproduction of 

25 CD quality audio using the desktop computer 1112. At some later time, whilst in 
possession of the telephone handset 1116, the user may select reproduction of the 
previously purchased CD audio which, by virtue of the lower quality output of the 
telephone handset, may be reproduced at a lower quality of service (eg. telephone quality). 
Such a change in the QOS can be detected by the media browser server 1102 depending 

30 upon the device connected at the time, thereby providing for the media browser server 
1102 to extract the appropriate content via the metadata server 1108 at the reproduction 
QOS applicable to the device in use at the time. 
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In each instance of the examples noted above, the TOC provided to the user is one 
derived from the user's data within the repository 1104, but modified by the media 
browser server 1 102 depending on the particular device being used at the time. With such 
an arrangement, the media browser server 1102 has the capacity to detect the type of 
5 device connected to the server 1102 at any time and ensure delivery of content to that 
device in an appropriate format without delivery of superfluous information. Such an 
arrangement is desirable through automatically limiting user selections to content that is 
able to be reproduced, and also in the reduction of bandwidth being consumed by data 
transfers across the system 1 100. 

10 VII. Controlling Right To Use 

One problem associated with the provision of electronic multimedia content to users 
is the extent to which the user may reproduce and/or copy that content for private use 
and/or distribution. This is particularly important in the case of the on-line sale of audio 
and video content and in the maintaining of copyright and the provision of royalties to 

15 artists and performers. Specifically, this problem is realised once a user has browsed the 
content available and has purchased a content selection. Typically, the purchasing 
provides a right to use the content with, in most cases, the purchase price being related to 
the manner in which the content may be used. Typically, content may be provided with a 
right to use that varies from a single play or use of the content, a play for a predetermined 

20 period of time (eg. one hour, one week, one year), indefinite usage by the purchaser (user) 
and, in some instances, the right to distribute the content either inhibited or uninhibited by 
further rights to use. 

An arrangement 1200 providing for such control over the right to use is illustrated in 
Fig. 12 where a user's device 1202 accesses multimedia content Via a media browser 

25 service 1206, the content being obtained from a content provider 1212 incorporating a 
metadata server 1214, a content repository 1216, and a metadata repository 1218 related to 
the content. As with the previous arrangements, the user of the device 1202 has 
metadata 1208 associated with the media browser service 1206 and which form the links 
and control that enable the content to be retrieved from the content provider 1212 and 

30 delivered to the user's device 1202. 

At the time of purchase, a rich link to metadata from the repository 1218 relating to 
the content 1216 is forwarded to the user. This rich link, which can also be described as 
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metadata, includes a link to the content, the metadata also including information regarding 
the content 1216 (eg. the listing of the program material) and is used for searching the 
content, access and usage information giving the user the right to use the content. The 
access information is required for the user to enable the digital content to be 
5 delivered/streamed to the user as and when required. In this fashion, the user does not 
store the content but, whenever the content is required to be reproduced, the user merely 
accesses the content via the media browser service 1206. As with the previous 
arrangement, the content may alternatively be stored with the media browser service 1206 
in which case the service 1206 acts as a trusted service for the content provider 1212. 

10 The arrangement 1200 may be operated using either one of the unencrypted or 

encrypted content being delivered to the user's device 1202. In either case, after purchase, 
a request 1226 to either view or play the content 1216 is issued from the user's 
device 1212 and received by the media browser service 1206. 

In the case of unencrypted delivery, an access key 1230 is sent 1222 from the 

15 media browser service 1206 to the metadata server 1214 in response to the request 1226. 
The metadata server 1214 validates the access key 1230 before delivering/streaming 1220 
the content 1216. The content 1216 is preferably delivered to the media browser 
service 1206 and from there to the user 1202 in a device sensitive manner 1228, which in 
this case is not encrypted. The access key 1230 is preferably formed of two parts. The 

20 first part comprises a key part 1210 which is stored with the user's metadata 1208 as part 
of the media browser service 1206 and operates to identify the user and the item of content 
that the user has a right to access. The key part 1210 may be a key value entered by the 
user at the time of purchase of the content 1216 and represents an equivalent to a personal 
identification number enabling access to that content, and may relate to the specific 

25 purchase of content and thus incorporates information regarding the user and the content 
being purchased. Alternatively, the key part 1210 may be automatically generated by the 
metadata server 1214. Another part 1224 of the access key 1230 may be unique to the 
media browser service 1206. As a consequence, the access key 1230 is formed by an 
authenticated two-part key pair arrangement which, upon receipt by the metadata server 

30 1214, provides for the delivery of the content 1216 to the user's device 1202 whereupon it 
may be reproduced using a player 1204 forming part of the device 1202. 
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It is to be noted that this (unencrypted) method described only ensures that the user 
is a valid receiver of the content purchased. Once the metadata server 1214 delivers the 
content, the metadata server 1214 has no control over the user storing the content or the 
content being intercepted by other potential users. 
5 In the case of encrypted content being sent to the user, when the metadata 

server 1214 receives a valid access key 1230, the metadata server 1214 responds by 
sending the content 1216 to the media browser service 1206. This delivery need not be 
encrypted since, in most implementations, the link between the media browser 
service 1206 and the metadata server 1214 may be a permanent or otherwise trusted 

10 connection. Encryption however may be applied for additional protection. The media 
browser service 1206 then encrypts the content 1216 using key information that identifies 
the user's current session and sends the encrypted content to the user's device. In this 
regard, the key information may be made of or generated from an identification (ID) 
provided by the user's client device during the request 1226. 

15 The player 1204 at the user's device 1202 then conditionally decrypts the content if 

it has a valid decryption key. This key may be the key information used to encrypt the 
content. Alternatively it may be a private key of a public/private key encryption pair. The 
session ID can form part or whole of a decryption key for the content. The requested time 
of streaming from the server 1206 may additionally or alternatively be used. 

20 This (encrypted) method of content delivery provides protection at three separate 

levels. Firstly, at the metadata server 1214, content is only delivered to the media browser 
server 1206 if a valid access key is received. Preferably, the media browser server 1206 
checks whether a request is valid with respect to the set time in the right of use conditions. 
That is, the media browser server 1206 checks on expiry date before sending the access 

25 key 1230 to the metadata server 1214 of the content provider 1212. The media browser 
service can do this by checking the link to usage information. This requires the usage 
information to be structured according to a predetermined schema. Alternatively, and 
preferably, the media browser may just send the request and the metadata server will check 
the usage rights. 

30 Secondly, the encrypted content cannot be intercepted between the media browser 

server 1206 and the user's device 1202 because it will not play for another session or user. 
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Thirdly, the content cannot be stored at the client because the player 1204 
conditionally decrypts the content. 

Alternatively, the key information used to encrypt the content may include other 
information such as the access key and expiry date obtained from the right to use metadata, 
5 or by a biometric information that can be checked by the player 1204. For example, the 
biometric information may include a fingerprint, or a voice key identification, to name but 
a few. 

VIII. Communicating Links Between Users 

It is often desirable for different users to exchange links to content they find 
10 interesting. 

Fig. 13 shows an arrangement 1300 where a media browser service 1302 is 
associated with a metadata server 1304 for the provision of content 1306 to users. The 
service 1302 incorporates a repository 1304 of tables of contents for each of a number of 
users. A number of users 1310 and 1312 are registered with and coupled to the service 

15 1302 by means of a communications network 1380, which is typically a public switched 
telephone network (PSTN) which may incorporate radio frequency components, such as 
cellular and microwave links in addition to wired landlines. 

The first user 1310 has a table of contents 1360 stored within the repository 1308 
and similarly the second user 1312 has a table of contents 1362. The second user table of 

20 contents 1308 incorporates metadata item 1314. The table of contents 1362 of the first 
user 1310 includes a metadata item 1318. Upon logon to the service 1302 by the first user 
1310, the TOC 1360 is reproduced 1314 at the user device and is seen to include an image 
thumbnail corresponding to the item 1318. The metadata item 1318 provides for content 
1330 to be delivered to the first user 1310, with the content 1330 as received being 

25 associated with the one year use period as seen at 1320. 

Where the first user 1310 desires to share the content 1330 with the second user 
1312, with knowledge of logon details of the second user 1312 with the service 1302, the 
first user can transfer the metadata from his TOC 1314/1360 to the TOC 1362. This is 
seen in Fig. 13 A by a transfer 1328 of the metadata 1318 within the repository 1308 from 

30 the TOC 1360 to the TOC 1362 to provide metadata 1322. However, since the second 
user 1312 has not purchased the content 1330, the right-to-use metadata associated with 
the transfer is altered from "one year's use" to "one play/view" thereby permitting the 
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second user 1312 only a single reproduction of the content 1330. At subsequent logon by 
the second user 1312, the TOC 1362 is loaded to the user's device and appears as a TOC 
1316 where the metadata 1322 is presented in an Inbox of the TOC 1316. 

Fig. 13B shows an arrangement 1390 similar to that of Fig. 13A and where like 
5 reference numerals relate to corresponding devices having corresponding functions. The 
arrangement 1390 of Fig. 13B however provides for the wireless communication of the 
metadata 1318 from the first user 1310 to the second user 1312. In this configuration, 
each of the user devices are provided with transmitters 1340, 1344 and complementary 
listeners (receivers) 1342, 1346, as illustrated which provide for bi-directional direct 

10 communication between the user devices, using wireless communications such as RF or 
IR. With such an arrangement, the first user 1310 may transfer the metadata 1348 from 
the TOC 1314 via the transmitter 1340 for reception by the listener 1346 which can then 
attend to the conveyance of the metadata to the TOC 1316. From there, the network 
connection 1380 enables the second user 1312 to update the TOC 1362 with the new item 

15 entry 1322. The specific advantage of this approach is that the sender of the information, 
does not need to know the logon details of the recipient. 

A further implementation may be obtained through wired or wireless communication 
of metadata to a person who is not a registered user of the media browser service 1302. In 
such an instance, the metadata may be wrapped in a voucher which contains the URI of the 

20 sender's 1310 media browser service 1302, together with an anonymous user login for the 
service 1302. The recipient of the voucher may then select the voucher (eg. via a mouse 
click) to access the media browser service 1302 and view the metadata via the anonymous 
login. The recipient can then play the content, dependent upon the right-to-use 
information in the metadata being sufficiently permissive. 

25 The voucher may be sent by e-mail. The voucher may also be sent in a wireless 

fashion if a transmitter/listener environment exists between the two devices. The received 
voucher would then appear on the recipient device's desktop. 

In a situation where the metadata 1326 is conveyed using any one of the above 
described methods, a number of possibilities remain where the metadata includes the right 

30 to use information. Firstly, the right to use may be left unchanged and as such may be a 
default case for content that is provided free of charge. Such an arrangement is unlikely to 
be supported by content providers who charge a fee for their service. A further possibility 
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is that the right to use defaults to a single use 1328 as illustrated in Fig. 13. A further 
alternative is that the right to use is altered in some way dictated by the rights to use 
information of the original metadata. This may involve communication with the metadata 
server of the content provider and new rights to use being transmitted to the new user. 
5 This process could automatically be initiated by the receiving media browser client. In this 
fashion, the transfer transaction may only be performed with the knowledge of the media 
browser and/or metadata server who then have the capacity to modify the table of contents 
of the new user and as a consequence the right to use information. 

It is preferable to communicate just the metadata 1326 rather than the content (not 

10 illustrated in Fig. 13) for a number of reasons. Firstly, mobile devices usually have larger 
receive bandwidths than transmit bandwidths (because such does not require an expensive 
transmitter) and thus the device supplying the link is not burdened by transmitting content. 
Also, the metadata can include information about the right to use the content and therefore 
the arrangement is attractive to content providers who desire to limit unlicensed use of the 

15 content. This is seen in Fig. 13 A where the first user 1316 has a one year use 1318 of the 
content but, upon providing the metadata links 1326 to the second user 1320, the second 
user is provided with a "one play only" right to use 1328. Further, an advantage of 
transmitting only the metadata is that the metadata can be added to the table of contents of 
the receiving user 1320 and therefore can be used for searching. Such metadata can also 

20 enable that receiving user 1320 to purchase the content and to obtain full rights to use as 
required. 

Preferably the metadata in each of the described cases is a link to either an individual 
description or to a metadata item in the metadata repository associated with the metadata 
server 1304. The communicated link can either contain the rights to use information or 

25 contain a reference or link to this information. The relationship between links, metadata 
and selectable content is shown in Fig. 13C. Since links can also contain attributes, can be 
loosely described also as metadata. 

In some instances it may be preferable to stream the content rather than the metadata. 
The advantage of streaming the content is that the stream content will be able to be 

30 received effectively at the same time (synchronised for singing along (kararoke) etc). 
IX. Switching Sessions Between Devices 
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Fig. 14 shows an arrangement 1400 that enables the user to switch a current media 
browser session from one device to another. Shown is a media browser service 1402 
configured to provide multimedia data streaming to a user operating, for example, a 
mobile telephone handset 1404 or hi-fi audio equipment 1412 within a motor vehicle 
5 1416, each of which are provided with appropriate players for reproduction of the 
multimedia stream. 

In an exemplary operation, the user in possession of the mobile telephone 1404 may 
request streaming media from a provider accessible via the media browser service 1402. 
Such may involve payment or may be free of charge. Upon making the request, the 

10 telephone may also transmit to the media browser service 1402 a public key to enable the 
provider for use in authentication. The provider can then send to the telephone 1404, via 
the media browser service 1402 if such are not one and the same, metadata relating to the 
session. This includes a session identifier and may include a key that is used to unlock the 
media. The key is encoded with the public key of the telephone 1404 so that only the 

15 specific destination telephone 1404 can decode the media using a complementary private 
key stored within the telephone 1404. The private key is typically secured within memory 
of the telephone 1404 and is not intended for export from that device, thereby preventing 
some other device masquerading as the telephone 1404. 

After the user has received the media using the telephone 1404 for some time, the 

20 user then decides on diverting the session (playback) to the equipment 1412 within his /her 
motor vehicle 1416. To achieve a diversion of the session, the user aims an infra-red (IR) 
transmitter/receiver (not illustrated in Fig. 14, but known for use with hand portable 
electronic devices) incorporated in the telephone 1404 at a corresponding IR 
transmitter/receiver 1414 associated with the car equipment 1412, and through depression 

25 of a transmit button 1408, the telephone 1404 transmits the metadata associated with the 
session to the car equipment 1412. 

The car equipment 1412 then operates to renegotiate the session with the media 
browser service 1402, performing the same step that the telephone 1404 performed to 
commence the session. This may involve negotiating extra payment, for example where 

30 the quality of serviceable to be reproduce by the equipment 1412 is greater than that of the 
telephone 1404 and such is desired by the user. A specific handover time is also 
negotiated, thereby permitting seamless transfer between the players. 
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When the handover time is reached, the media browser service 1402 stops sending 
session media to the telephone 1404 and commences sending the media to the car 
equipment 1412. The metadata used by the telephone 1404 is now invalid and no more 
media can be obtained using that particular metadata. The switchover may involve 
5 obtaining estimates of the path delays of the telephone 1404 and the car equipment 1412. 
Methods of estimating path delay in network connections between two participants are 
known per se in the networking arts. 

A specific advantage of this arrangement is that, via the notification to the media 
browser service 1402, the nature of the content stream may be altered conditional upon the 

10 destination device upon which reproduction is to be performed. This may be performed 
subject to the quality of service purchased when the original sessions was entered. In an 
alternative, the change of destination device may prompt the user to improve the quality of 
service by accepting new terms of delivery and any costs associated therewith. In this 
fashion, the delivery of MP3 audio to the telephone handset 1404 may be replaced by the 

15 streaming of CD quality audio to the car hi-fi equipment 1412. Further, having alighted by 
the motor vehicle 1416, the user may choose the transfer the session to a more advanced 
device, such as the desktop computer 1112 of Fig. 11 in which case, video may be 
streamed together with the audio where the content being supplied at the time includes a 
video component (eg. a music video which may be "listened to" on the telephone or in the 

20 car radio or viewed (watched and listened to) via the desktop computer 1112). 
X. Usage Information 

With the various arrangements described above, such as for example Fig. 12, it will 
be appreciated the content provider 1212 can readily accumulate information about the 
number of times particular content items have been requested from their metadata 

25 server 1214 via the media browser server 1206, or any other such server (not illustrated). 
However, extra usage information may be provided by the media browser service 1206 
and such may include: 

(a) the type of devices that are being used to play/view particular items of 
content; and 

30 (b) the usage of particular content items according to demographic and/or 

geographic indications at the media browser server 1206. 
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The content provider 1212 will only have total usage statistics from their own 
metadata server 1214. There may be commercial value in obtaining usage information 
from individual media browser services. This information could be used to influence how 
the content provider 1212 advertises their metadata server (ie. whether to pay to have their 
5 link included in a default TOC of a particular service). 

Although web servers can effectively already provide these statistics for 
downloads, having this information for streaming content gives more accurate statistics 
because it measures "uses" rather than downloads. When a user downloads some digital 
content, no information is obtained about the number of times the content is actually used 
10 since the content is stored by the user. With the described arrangements, the content is not 
stored by the user but streamed or otherwise supplied on each occasion an access is made 
via the media browser service 1206. Also, obtaining statistics on the usage per device type 
can assist content providers target their support/research into playing platforms. 

XI. Providing Suitability Rating Control 

15 With the various arrangements described, an allowable rating may be associated with 

a user account with the media browser service 1206, or a particular device. The former 
requires that there be a range of media browser accounts. Such may be implemented for 
example by a parent (controller) and child (controlled) accounts. The ratings control could 
be used for: 

20 (a) controlling playing/viewing; and/or 

(b) controlling buying. 

Although users can currently place rating controls on people using the web (ie. child 
pages, etc) these rating levels are generally decided globally. The concept of parent/child 
accounts and allowing parent accounts to specifically control the content able to be 
25 played/viewed/purchased using one or more other accounts that are designated as being 
under the parent's control is a highly desirable and customisable method of controlling 
access. Importantly, the parent is able to control the access of just their own children. 

Access may also be controlled by the reproduction device. In this case the parent 
and child could have their owner user accounts or use the same account. However, the 
30 device used by the child may be used to limit the access to some of the available items. 

XII. Locating Media Browser Services 
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A content provider 1212 may wish to be able to identify, possibly all, media browser 
services (such as 1206) available over the computer network (Internet, Web, etc). Such 
may be desired so that the content provider may advertise their content to these services. 
The content provider 1212 may therefore conduct a search of the network to identify 
5 servers that offer the particular (media browser) service and to which the advertising 
material may be distributed. 
XIII. User Interface Navigation 

The user interface described above with reference to Fig. 4 is, like most graphical 
user interfaces (GUI's), a device which seeks to maximise functionality through optimal 
10 presentation of graphical information, some of which is selectable. Such a GUI is 
expected to be used by semi-professional/business users such as graphic designers, 
marketing persons, and the like, as well as by domestic computer users. As with all GUI's, 
display real estate is expensive and it is always desirable to optimise information 
presentation. Database navigation is an important component of most GUI's and such 
15 provides a mechanism by which the user may traverse ordered sets of linked information. 
Traditionally, GUI navigation is performed using a tree-representation of the database by 
which the user selects certain branches of the tree so as to locate desired data. Such a 
presentation, which occurs for example in WINDOWS EXPLORER (trade mark of 
Microsoft Corporation of USA), occupies substantial display real estate, typically along an 
20 entire side of the display screen. Significantly, where many branching levels are 
encountered, such trees extend across the width of the display screen occupying further 
display real estate that may be desired for other purposes. 

An alternative GUI which may be used with the media browser arrangements 
described herein as well as in other arrangements, is an interface 1600 shown in Fig. 16. 
25 When a user logs onto the media browser service, the user is presented with his/her current 
TOC. The top-level items appear in a browse window represented by their visual 
identifiers. 

Navigation through a user's information landscape, or TOC, can be achieved by the 
user interacting with a hierarchical "breadcrumb", which is formed from locations 1602 
30 and 1604 both located above the viewing window 1606. The term "location" is used to 
refer to non-leaf node TOC descriptors. Within each level of the breadcrumb a user can 
select to pull down a menu of other location options at that level of the breadcrumb. 
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Navigation using the hierarchical breadcrumb is exemplified in Figs. 17A and 17B. 
Fig. 17A shows a portion of the GUI 1600 in which the user's TOC is found under a 
selectable tab 1702 entitled MyDocuments. When selected that tab lists the directly linked 
subdirectories thereunder. Selecting a subdirectory by clicking a mouse for example 
5 creates an adjacent list 1704 for the selected subdirectory. From Fig. 17A it will be 
apparent that the user has selected My! mages. Similar selections create new lists in order 
to reveal two JPEG images taken during a holiday in Cairns in 2000. In Fig. 17A it will be 
apparent that the ticks (3) next to each subdirectory name indicate the selected path into 
the database thereby providing the user with an appropriate contextual reference for the 

10 traversal of the database. Where, at any time, the user desires to follow an alternative path 
into the database, any unselected sub-directory may be selected thereby revealing the 
corresponding breadcrumb display. The breadcrumb lists in this regard display only those 
directories or folders subordinate to the selected folder (menu). A display window 1710 is 
arranged beneath the lists to display a representation (eg. a thumbnail image) of any 

15 content items 1712, as distinct from sub-directories or folders within or subordinate to the 
selected folder/sub-directory. Fig. 17B shows the result of where the user, from the 
configuration of Fig. 17 A, has selected MyVideo and the navigation display has altered to 
show those sub-directories having video content. Note that in each of Fig. 17A and 
Fig. 17B, the subdirectory Hoi. 2000 is indicated since that directory contains both (static) 

20 images and video. As before, further sub-directories 1714 and 1716 may be listed upon 
appropriate selection of items from displayed sub-directories. Fig. 1 7B shows the display 
window having content items 1730 which form part of the MyVideo sub-directory. 

One of the advantages of this navigation approach over a standard tree-based method 
is that it is simple to exit from one sub-branch and directly enter another without having to 

25 navigate up and down along the various branch paths. A user can also navigate through 
his/her TOC using the standard method of simply double clicking on visual identifiers in 
the viewing window 1606 in order to display child items of the clicked item. 
Alternatively, previously visited locations can be re-displayed by selecting the desired 
location in a history list 1608 located to the right of the breadcrumb on the interface in 

30 Fig. 16. 

The user can define the number of items to display in the viewing window. Often it 
is desirable to be able to visually present as many items as possible (eg. search results). 
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However if too many items are displayed the content of the items becomes difficult to 
understand. If a location contains more items than can be viewed in a single window 1606 
then a viewing window control 1610 below this window can be used to page through the 
content. Paging was selected in preference to scrolling because trial users expressed this 
5 preference in early useability studies. 

The panel on the top left of Fig. 16 is the search panel 1612. Users can select a 
location from their TOC to be searched. Both simple text-based searches and advanced 
searches are accommodated to allow the user to select the desired descriptors and their 
required values. In an advanced search, a user can construct a query by specifying 

10 constraints on selected descriptors. In other words, the resulting query is similar to a 
browsing expression with a filter and can be expressed directly as an XPath location. 
Although the interface 1600 allows the user to filter search expressions based on content 
types (eg. image, video and/or audio), the media browser server can represent metadata for 
any media type. A media type selector 1614 is included in the interface to improve the 

15 useability of the interface for the target users. 

A panel 1616 on the bottom left displays a set of commonly used properties (index 
descriptors) for the currently selected item in the viewing window. The properties viewed 
are predetermined. If the selected descriptor does not contain index descriptors with these 
descriptor names, and these properties can not be identified by examining the available 

20 index descriptors (detecting, for example, other index descriptors which might also have a 
type date or a similar descriptor name), then no values are displayed. 

The panel 1618 on the right hand side of Fig. 16 contains a list of all the current 
stacks that are open in the interface. Stacks are a user's personalised stores of links to 
metadata (ie. each item in a stack is just a URI with an optional XPointer). Stacks can be 

25 used to store search results, references to favourite images from a set of image libraries or 
a current work set for another task. Items can be dragged from the viewing window onto 
any of the open stacks. Stacks are treated like locations and can be opened and viewed in 
the viewing window. When an opened stack is closed the previous non-stack location is 
replaced in the viewing window. If a user is finished with a stack they can choose to save 

30 the stack for use in a later session. This results in the stack data being saved to the user's 
personal data. 
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A shopping trolley icon 1620 at the bottom right of the interface 1600 is just a 
specialised stack for items for purchase. A "purchase wizard" may be invoked from this 
icon to facilitate shopping and payment where required. The wizard preferably is able to 
purchase the content associated with the metadata in the shopping trolley using the user's 
5 account with the media browser. In other words, if the content was owned by different 
content providers, it would not be necessary for the user to have to visit each vendor to 
purchase those items provided by that vendor. 
XIV. Extending the Information Landscape 

In the arrangements described above, allowance is provided for the user to search for 
10 metadata in locations which are part of the user's information landscape. The landscape is 
typically provided to the user upon initial subscription to the media browser service. 
Although the user can add to this information landscape, the user may not necessarily be 
aware of other new and possibly relevant locations that may become available over the 
course of time. 

15 Feedback about the usefulness and/or relevance of an item is important if it is 

desired to attempt to introduce new material to a user in a predictable and useful manner. 
The above described browsing arrangements have an advantage over standard web 
browsers in they have a good positive feedback indicator (ie. a reliable measure of whether 
an item is useful or of interest to a user). This measure is a user's action to store a link to 

20 the item in a stack 1618 (see Fig. 16). The stacks 1618 are stored and managed within the 
media browser server. Because the media browser application operates as a service, the 
server from which the application operates can register every "move to a stack" action for 
all its registered users. This approach is not possible for a traditional web server because 
the action of storing a link in a user's bookmarks is handled by the browser locally on the 

25 user's workstation. 

The media browser server can thus retain statistics on those URI's that are 
considered useful to its users/subscribers, and can then use those statistical results to 
introduce new links to a user. The results can relate to all locations "stacked" by all users, 
and a ranking of the links is also possible. The user, on sampling the new links, can then 

30 choose to add the new links to his/her own information landscape for future use. 

A system of this kind may be implemented through the incorporation into the 
media browser graphical user interface, such as those shown in Figs. 4 and 16 of an 



535245.doc 



-57- 



altemate searching option. An example of this is shown in Fig. 18 where the GUI 400 of 
Fig. 4 is modified, although such modifications may be similarly and readily made to the 
GUI 1600 of Fig. 16. As seen the GUI 400 of Fig. 18 includes a designator 1800 entitled 
"LOCATIONS TO SEARCH", which, in a fashion akin to Figs. 4 and 16, includes an 
5 option 1802 designated "ALL MY LOCATIONS" representing all of the locations in the 
user's personal information landscape. In Fig. 18, a further option 1804 entitled "ANY 
LOCATION" is included. When a user selects to search using "ANY LOCATION" 1804, 
the media browser server may respond with one or more of the following options: 

(1) to use a simple rank to create a search list from the stacked locations of 
10 all its registered subscribed users, and then order the corresponding 

results; 

(2) to only select the first, say, "n" locations of the rank of (1) and search 
those locations; and 

(3) to use the rank list as in (2) but randomly mutate the ranked locations 
15 with locations lower in the rank list. 

Experiments have indicated that Option (1) does not scale well and results in a lot 
of unnecessary searching because users typically do not look at the last ranked results 
when significant numbers of results are returned. Also users are typically overwhelmed by 
large result lists. Option (2) alone has the disadvantage that new sites have trouble 

20 becoming known because such would typically be low in the ranking. Option (3) 
alleviates the problem encountered by Option (2) by randomly introducing the lower 
ranked locations into the top "n" locations to be searched for the user. In this regard, "m" 
lower ranked locations may be inserted into the search set, with "m" being (usually 
substantially) less than "n". A potential problem with Option (3) is that, due its the 

25 randomness, any particular search is unlikely to be repeated identically without a special 
option to do so. The frequency at which the random mutations are performed in Option 
(3) may be determined by an "explore parameter" that may be tuned or varied by the user. 
The tuning or variation therefore indicates how much the user wants to explore the less 
used locations. 

30 A preferred implementation of the mutation process may be expressed in pseudo- 

code as follows: 
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<mutate> 

determine how many rank positions are to be mutated based on the "explore 

parameter"; 
for each rank position: 
5 select a rank position (1-n), and; 

select a lower-ranked location taking care not to repeat existing locations in 
the search list; 

<end>. 



10 The process described above may be varied in a number of ways. Firstly, the server 

can perform ranking based on media type. For example a separate ranked list of URIs may 
be provided for each of images, video and audio. Also, ranking lists can be augmented by 
ranking lists generated or used by of other media browser servers. Spiders, and other 
known tools for web searching may be used to detect other media browser servers. 

15 The entities stored and manipulated by the stacks are network identifiers such as 

URTs (ie. a URL + possible Xpointer). Such are therefore rankable, and the granularity of 
the ranking is arbitrary. On one hand fine granularity of ranking is good for fine tuning of 
repositories where the quality of the metadata/content is variable. Such may be the case, 
for example, where one catalogue is good, and another is poor and relatively unused. 

20 However the overhead of fine granularity is high and some form of generalisation may be 
needed to obtain useful results. The preferred approach is for the entity being ranked to be 
a URI for either an XML description or a metadata server root. 

Every time a media browser server receives a request to copy a link to a stack, the 
media browser server simply increments the usage count of the corresponding rankable 

25 URI in a rank file. Whenever a user selects to search "ALL LOCATIONS", then the 
server selects the most highly ranked locations, performs the necessary mutation in the 
ranked location list, and then performs the search. 

In a specific implementation, access to the ranked list derived from the stacks may 
be limited to those subscribing user's who pay a premium subscription rate entitling access 

30 to such a facility. 

As an extension of the above, known web browsing applications may be similarly 
modified so that when a user creates a bookmark to a particular web location, a message 
may be automatically sent to a server site, the message including details of the 
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bookmarked location. With such an arrangement, the server site may create its own 
database of locations visited by users of the Web, thereby enabling a ranking of those sites. 
The ranking and list may then be used by search engines to offer sites of perceived quality 
in preference to sites that are less visited, and therefore of lower perceived quality or 
5 usefulness. 

Similarly, for each of the media browser and web browser implementations, a user's 
access to the bookmarked/stacked locations may similarly be monitored and 
communicated to the respective server whereby the access may be accumulated as a 
frequency count against the particular location which can then be interpreted to adjust the 
10 ranking of the location with the respective list, for example, in the case of the media 
browser application, each time a user chooses to view content associated with a link in a 
stack, this action may can also be construed as a positive reference for the item and thus 
may be monitored for ranking in a similar way to that described when a user adds a link to 
a stack. 

15 XV. Maintaining the Currency of a User's Information Landscape 

When a link is formed by a user's bookmarks in a traditional web browser (eg. 
'Netscape Navigator' by Netscape Corp. or 'Internet Explorer' by Microsoft Corp), there is 
no automated way for those links to be checked for integrity. Integrity may only be 
assessed in such arrangements by the user trying to follow a link, and then receiving an 
20 error in reply. 

The media browser can improve on this situation because the server has knowledge 
of the links, by virtue of the links being stored in the metadata database (eg. 107). 
Accordingly, the media browser server can initiate a process that checks any such link 
integrity. If a link is no longer valid, then the media browser may remove the link, but 
25 preferably highlights the erroneous link in some manner. For example, the visual 
identifier corresponding to the link may be enclosed by a red border. The media browser 
can check the integrity of both links to other descriptions/metadata servers and links to 
content, the latter involving accessing individual descriptors in a description. 

The integrity checking performed by the media browser server may be performed as 
30 a background task, not associated with any particular user request. 

Industrial Applicability 
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The arrangements and implementations described above are applicable to the 
computer and data processing industries and particularly those providing multimedia 
services. The implementations specifically provide for Internet service providers to add 
commercial value to the searching services they provide and/or host whilst facilitating the 
5 matching of vendors and purchasers of content. 

The foregoing describes at least one embodiment of the present invention, and 
modifications and/or changes can be made thereto without departing from the scope and 
spirit of the invention, the embodiment(s) being illustrative and not restrictive. 

(Australia Only) In the context of this specification, the word "comprising" means 
10 "including principally but not necessarily solely" or "having'* or "including" and not 
"consisting only of. Variations of the word comprising, such as "comprise" and 
"comprises" have corresponding meanings. 



535245.doc 



* 



-61 - 



Appendix 1 

<?xml version= , 1.0 , ?> 

<!- (C) Copyright Canon Information Systems Research Australia (CISRA) 2000 -> 
5 <!~ All rights reserved -> 

<!- Source description for transformation example in Fig 15 — > 

<VideoDocument 
10 DC.Title = 'Survival of the Platypus 1 
DC.Creator = 'John Smith 1 
DC.Subject = 'Australian Wildlife' 
DC.Type = 'Digital Video' 

href = 'httpV/wwwxisraxom.au/MediaBrowser/examples/content/AusWildSSS.mpg^ 
15 <Scene 

DC.Identifier = 'Scene 1" 
DC. Description = Taronga Park Zoo' 
DC.Coverage.T.Start = '2:05.00' 
DC.Coverage.T.End = '5:10.25'> 
20 <Shot 

DC.Identifier = 'Shot V 
DC. Description = 'Mature male platypus' 
DC.Coverage.T.Start = '2:05.00' 
DC.Coverage.T.End = '2:55.20' 
25 keyFrame = 'http^/NAAAw.cisra.com.au/MediaBrowser/examples/content 

/AusWild883J<F1.jpg'> 

<Frame 

DC.Identifier = 'Frame V 
DC. Description = 'Key Frame7> 
30 <Frame 

DC.Identifier = 'Frame 2' 
DC. Description = 'Description of Frame 27> 
<Frame 

DC.Identifier = 'Frame 3' 
35 DC. Description = 'Description of Frame 37> 

</Shot> 
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<Shot 

DC. Identifier = 'Shot 2' 
DC. Description = 'Zoo habitat' 
5 DC.Coverage.T.Start = '2:55.24' 

DC.Coverage.T.End = '4:02.10' 
keyFrame = 

'http://vvww.cisra.com.au/MediaBrowser/examples/content/AusWild883_KF2.jpg'> 
</Shot> 

10 

<Shot 

DC.Identifier = 'Shot3' 
DC. Description = 'Interaction with humans' 
DC.Coverage.T.Start = '4:02.14' 
15 DC.Coverage.T.End = '5:10.25' 

keyFrame = 'http://www.cisra.com.au/MediaBrowser/examples/content 
/AusWild883_KF3.jpg'> 

</Shot> 

20 </Scene> 
<Scene 

DC. Identifier = 'Scene 2' 
DC. Description = "Natural habitat' 
DC.Coverage.T.Start = '7:15.25' 
25 DC.Coverage.T.End = '3:05.20'> 

</Scene> 
<Scene 

DC. Identifier = 'Scene 3' 
DC. Description = 'Mating scene' 
30 DC.Coverage.T.Start = '10:20.45' 

DC.Coverage.T.End = '2:02.05'> 
</Scene> 

<A/ideoDocument> 



535245.doc 



-63- 



Appendix 2 

<?xml version= , 1.0 , ?> 

<!- (C) Copyright Canon Information Systems Research Australia (CISRA) 2000 — > 
5 ■<!— All rights reserved — > 

<!- XSLT 1.0 -> 

<xsl:stylesheet 

10 xmlns:xsl= , http://www.w3.org/1999/XSLyTransform , 

xmlnsrmb = 'http://www.cisra.cx3m. au/MediaBrowser' 

xmlns:xlink = , http://www.w3.org/1 999/xlink' 

version = '1.0'> 
<xsl:output 
15 method = 'xml' 

version = '1.0' 

standalone = 'yes' 

indent = 'yes'/> 

20 <xsl template match = VideoDocument'> 
<VideoDocument 

xmlnsrmb = 'httpr//cisra.com.au/MediaBrowser' 
xmlnsrxlink = 'http://www.w3.org/1999/xlink' 
mbrdescriptorType = TOC 
25 mb:textualldentifier = '{©DC.Title}' 

mb:updateable = 'false 1 
xlink:type = 'simple' 
xlinkrrole = 'resource' 
xlinkrhref = '{@href}'> 
30 <DC.Title 

mb:descriptorType = 'Index' 
mbrtextual Identifier = Title'> 
<xsl:value-of select = '@DC.Title'/> 
</DC.Title> 
35 <DC.Creator 

mbrdescriptorType = 'Index' 
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mbitextualldentifier = 
<xsl:value-of select = 

</DC.Creator> 

<DC.Subject 

mb:descriptorType = 
mbitextualldentifier = 
<xsl:value-of select = 

</DC.Subject> 

<DC.Type 

mb:descriptorType = 
mb:textualldentifier = 
<xsl:value-of select = 

</DC.Type> 
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, Creator J > 
'@DC.Creator7> 



•Index 1 

'Subjects 

'@DC.Subject7> 



'Index 1 

Type'> 

•@DC.Type7> 



15 <!- Now push all scene children -> 

<xsl:apply-templates select = 'Scene'/> 
</VideoDocument> 
</xsl:template> 

20 <xsl: template match = 'Scene'> 
<Scene 

mb:descriptorType = TOC 
mbitextualldentifier = '{©DC. Identifier} 1 
mb:id = '{@DC. Identifier}' 
25 xlink:role = 'resource' 

xlink:href = , {/A/ideoDocument/@href}#avptr(time::{@DC.Coverage.T.Start} l 

{@DC.Coverage.T.End})*> 
<DC.Description 

mb:descriptorType = 'Index' 
30 mbitextualldentifier = 'Description^ 

<xsl:value-of select = '@DC.Description'/> 
</DC.Description> 



35 



<!— Now push all scene children — > 
<xsl:apply-templates select = 'ShotV> 
</Scene> 
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</xsl:template> 

<xsl template match = 'Shot'> 
<Shot 

5 mb:descriptorType = TOC 

mb:visualldentifier = '{@keyFrame}' 
mb:id = '{©DC. Identifier} 1 
xlinkirole = 'resource 1 

xlink:href = , {/A/ideoDocument/@href}#avptr(tirne::{@DC.Coverage.T.Start}, 
10 {@DC.Coverage.T.End})'> 
<DC.Description 

mb:descriptorType = Index 1 
mb:textualldentifier = 'Description^ 
<xsl:value-of select = , @DC.Description , /> 
15 </DC.Description> 

</Shot> 
</xsl:template> 

20 </xsl:stylesheet> 
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CLAIMSs Th e claims defining the invention are as follows: 

1. A method of facilitating access to multimedia content distributed over a computer 
network, said method comprising the steps of: 

at a server within said network, monitoring location recording activities of 
5 subscribers of said server to form a ranked list of multimedia locations recorded by said 
subscribers; 

at a browser coupled to said server, receiving search criteria relating to said 
multimedia content entered by a subscriber; and 

applying said search criteria to said ranked list to provide said subscriber with 
10 network identifiers for said multimedia content corresponding to those said recorded 
locations best satisfying said search criteria. 

2. A method according to claim 1 wherein said search criteria is applied to an 
entirety of said ranked list. 

15 

3. A method according to claim 1 wherein said search criteria is applied to a subset 
of the network identifiers comprising said ranked list. 

4. A method according to claim 3 wherein said search set comprises a first 
20 predetermined number of highest ranked network identifiers from said list. 

5. A method according to claim 4 further comprising the steps of randomly 
identifying from lower ranked ones of said network identifiers of said list a second 
predetermined number of said network identifiers, and substituting said second number of 

25 network identifiers into said search set, said second number being less than said first 
number. 

6. A method according to claim 5 wherein said second number of network 
identifiers are randomly substituted into said search set. 

30 

7. A method according to claim 5 or 6 wherein said random selection of said lower 
ranked network identifiers is influenced by a subscriber selected parameter. 
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8. A method according to claim 7 wherein said parameter determines said second 
number of said network identifiers. 

9. A method according to claim 8 wherein said random selection comprises, for each 
5 of said second number, selecting a rank position from said list and selecting a lower 

ranked one of said network identifiers to be substituted into said search set. 

10. A method according to any one of the preceding claims wherein said network 
identifier identifies an item of metadata, wherein said metadata represents one of a 

10 description of said item of multimedia content or a description of a collection of 
multimedia content. 

11. A method according to any one of the preceding claims wherein said network 
identifiers represent descriptions of collections of multimedia content, in which individual 

15 said descriptions of said content are accessible via branch network identifiers depending 
from other said network identifiers. 

12. A method according to claim 11 wherein said network identifiers are selected 
from the group consisting of a URI for an XML description, and a URI for a metadata 

20 server root. 

13. A method according to any one of the preceding claims further comprising, at 
said server, examining a corresponding ranked list at another server within said network 
and modifying said ranked list with locations obtained from said corresponding ranked list. 

25 

14. A method of forming a searchable list of computer network locations, said 
method comprising the steps of: 

monitoring, at a server to a plurality of network subscribers, bookmarking 
activities of said subscribers; 
30 forming a list of network identifiers bookmarked by individual ones of said 

subscribers; and 

ordering said network identifiers in said list according to a frequency of 
bookmarking by said subscribers. 
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15. A method according to claim 14 further comprising: 

monitoring accesses by individual ones of said subscribers to said network 
identifiers within said list to modify said frequency; and 

re-ordering said list according to said modified frequency. 

5 

16. A method of facilitating access to multimedia content distributed over a computer 
network, said method comprising, at a server within said network, the steps of: 

providing an application for subscribing users thereof to enable said users to 
access multimedia content, each item of said multimedia content being identified by a 
10 network identifier; 

monitoring individual said subscribing users' intent to use or store items of said 
multimedia content; and 

in response to said monitoring, forming a ranked list of network identifiers 
relating to said items for use in facilitating access to said multimedia content for a 
15 predetermined set of said subscribing users, wherein an ordering of said list is determined 
by a frequency that the corresponding said network identifier is referenced with intent to 
use or store by said subscribing users. 

17. A method according to claim 16 wherein said predetermined set of said 
20 subscribing user comprises all said subscribing users. 

18. A method according to claim 16 wherein said network identifier comprises a 
uniform resource identifier. 

25 19. A method according to claim 16 wherein said network identifier comprises a 
uniform resource location. 

20. A method according to claim 16 wherein each said item is identified directly by 
said network identifier. 

30 

21. A method according to claim 16 wherein each said item is reference indirectly by 
said network identifier. 
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22. A method of facilitating access to multimedia content at locations distributed over 
a computer network, substantially as described herein with reference to Fig. 18 of the 
drawings. 

23. A browser application for performing the method of any one of claims 1 to 22. 

24. A server application for performing the method of any one of claims 1 to 22. 

25. A computer system incorporating an application according to claim 23 or 24. 

26. A browser application for operation upon a subscriber terminal, said application 
comprising means for signalling to a server to which said application couples, a 
bookmarking of a location accessed by said application, said location being within a 
computer network incorporating said server. 

27. A server operating within a computer network, said server having at least one 
user browser application associated therewith and via which said user browser application 
accesses locations within said network, said server comprising: 

means for receiving from said user browser application, bookmark information 
relating to a location recorded for subsequent access by said user browser application; 

means for integrating said bookmark information received from plural ones of 
said user browser applications to form a database of said bookmark information. 

28. A server according to claim 27 further comprising: 

search engine means for searching said bookmark information. 

29. A server according to claim 27 or 28 wherein said bookmark information 
comprises a location within said network of an source of multimedia content. 

30. A server according to claim 29 wherein said bookmark information further 
comprises a count of a number of accesses to said location. 

31. A computer readable medium incorporating a computer program for enabling a 
computer apparatus to facilitate access to multimedia content at locations distributed over 
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a computer network, said computer program comprising an application according to any 
one of claims 23, 24 or 26. 



Dated this Twenty-second day of December 2000 
CANON KABUSHIKI KAISHA 
Patent Attorneys for the Applicant 
SPRUSON&FERGUSON 
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SOURCE DESCRIPTION 
(see Appendix 1) 
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VMeoDocumen^ 
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1510 



Scene 



3 



1512 



(^Vid 



DC.Title=*Survival of the Platypus* 
DC.Creator='John Smith' 
DC.Subject=*Australian Wildlife 1 
DC.Type='Digital Video* 
href^'http://../AusWild883.mpg' 



1502 



DC.Identifier='Scene V 
DC.Description='Taronga Park Zoo' 
DCCoverage.T.Start='2:05.00' 
DC.Coverage.T.End='5: 10.25' 



Scene 
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DC.Identiftei='Shot 1 ' 
DC.Description='Mature male platypus 
DC.Coverage.T.Start='2:05.00' 
DC.Coverage.T.End='2.55.20' 
keyFrame='http://../KFl .jpg' 
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Frame 
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DC. Identifier^' Frame 1' 
DC.Description^'Key frame* 
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XSLT (see stylesheet in Appendix 2) 



NORMALISED DESCRIPTION Vldec f) urr en 
1590- — \ 1530 



mb:descriptorType='TOC' 
mb:textualldentifier= {DC.Title} 
mb: updateable=' false ' 
xlink:role='resource' 
xlink:href i ={href} 
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mb :descriptorType=' Index ' 
mb:textualIdentifier='Title* 



DC.Title 
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mb:descriptorType= ' Index * 
mb:textualldentifier= ' Description * 



DC .Description 
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mb :descriptorType= *TOC ' 
mb : textualldentifier= { DC. Identifier } 
mb:id= {DC. Identifier} 
xlink:role=' resource' 
xlink:href='http://../ AusWild883.mpg 
.#avptr(time::2:05.00,5: 10.25)' ^ 
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mb:descriptorType='TOC' 
mb: textualldentifier= { DC. Identifier } 
mb:visualldentifier= 

'http://mySite/KFl jpg' 
mb:id={DC.Identifier} 
xlink:role='resource' 
xlink:href='http://../AusWild883.mpg 
.#avptr(time::2:05.00,2:55.20)' ^ 
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mb: descriptorType= ' Index ' 

mb: textual Identifier ' Description ' 
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